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The present invention 
is a substantially purified 
sortase-transamidase enzyme from 
Gram-positive bacteria, such as 
Staphylococcus aureus. The enzyme 
having a molecular weight of about 
4 1 ,000 daltons and catalyzing a 
reaction that covalently cross-links 
the carboxyl terminus of a protein 
having a sorting signal to the 
pepudoglycan of a Gram-positive 
bacterium, the sorting signal having: 
(1) a motif of LPX3X4G therein; (2) 
a substantially hydrophobic domain 
of at least 31 amino acids carboxyl 
to the motif; and (3) a charged tail 
region with at least two positively 
charged residues carboxyl to the 
substantially hydrophobic domain, 
at least one of the two positively 
charged residues being arginine, 
the two positively charged residues 
being located at residues 31-33 
from the motif, wherein X3 is any 
of the twenty naturally-occurring 
L-amino acids and X4 is selected 
from the group consisting of alanine, 
serine, and threonine, and wherein 
sorting occurs by cleavage between 
the fourth and fifth residues of the 
LPX3X4G motif. Variants of the enzyme, methods for cloning 
of use of the enzyme, including for screening for antibiotics 
bacteria, are also disclosed. 
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IDENTIFICATION OF SORTASE GENE 

GOVERNMENT RIGHTS 

This invention was supported by grants from the United States 
government, namely grants from the National Institutes of Health, NIH-NIAID Grant 
Nos. AI 33985 and 38897. Accordingly, the government may have certain rights in 
this invention. 

5 

BACKGROUND OF THE INVENTION 
This invention is directed to an enzyme from Gram-positive bacteria, 
designated sortase-transamidase, nucleic acid segments encoding the enzyme, and 
methods of use of the enzyme. 

10 Human infections caused by Gram-positive bacteria present a medical 

challenge due to the dramatic increase in multiple antibiotic resistance strains in 
recent years. Gram-positive bacteria that can cause serious or fatal infections in 
humans include Staphylococcus, Streptococcus, Enterococcus, Pneumococcus, 
Bacillus, Actinomyces, Mycobacterium, and Listeria, as well as others. Infections 

15 caused by these pathogens are particularly severe and difficult to treat in 

immunologically compromised patients. These include patients suffering from 
infection with the Human Immunodeficiency Virus (HIV), the virus that causes AIDS, 
as well as patients given immune-suppressive agents for treatment of cancer or 
autoimmune diseases. In particular, infections caused by various Mycobacterium 

20 species, including M. tuberculosis, M. bovis, M avium, and M. intracellular e, are 
frequently the cause of disease in patients with AIDS. 

Therefore, it is apparent that new target sites for bacterial 
chemotherapy are needed if such pathogenic organisms are to be controlled. 

A unique characteristic of these pathogens and many Gram-positive 

25 bacteria is their surface display of proteins anchored to the cell wall. In fact, many of 
these molecules are known to be involved in essential cellular functions, including 
pathogenesis in a susceptible host. Thus, a possible disruption in this anchoring 
process may prove to be an effective treatment against these disease-causing 
elements. 

30 The anchoring of surface molecules to the cell wall in Gram-positive 

bacteria has been demonstrated to involve a conserved pathway, culminating in 
recognition of a conserved cleavage/anchoring site by some previously 
uncharacterized cellular machinery. Molecules whose ultimate location is the cell 
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wall must invariably be translocated across the single cellular membrane of these 
organisms. This is mediated for all cell wall anchored proteins by the well studied 
secretory pathway, involving cleavage of an amino-terminal signal peptide by a type I 
signal peptidase. Upon translocation of the molecule out of the cytoplasm, a 
5 mechanism must be present that extracellularly recognizes this protein as a substrate 
for anchoring. This process has been previously shown to involve the carboxyl- 
terminally located cell wall sorting signal, consisting of a highly conserved motif such 
as LPXTG (SEQ ID NO: 1), in which X can represent any of the twenty naturally 
occurring L-amino acids, followed by a series of hydrophobic residues and ultimately 

10 a sequence of positively-charged residues. Thus, once amino-terminally modified 
and successfully secreted, a polypeptide with this carboxyl-terminal sequence can 
present itself as a substrate to be processed by the anchoring machinery. At this time, 
cleavage of the sorting signal after the threonine residue is coupled with covalent 
linkage of the remainder of the polypeptide to the free amino group of the 

1 5 pentaglycine crossbridge in the cell wall. 

It is this transpeptidation reaction that anchors mature surface proteins 
so that the peptidoglycan layer, from which point the molecules can serve their 
biological functions. Therefore, there is a need to isolate and purify the enzyme that 
catalyzes this reaction. There is also a need to identify the gene encoding such an 

20 enzyme in order that the enzyme can be produced by genetic engineering techniques. 

Additionally, there is also a need to develop new methods for 
displaying proteins or peptides on the surfaces of bacteria. For many purposes, it is 
desirable to display proteins or peptides on the surfaces of bacteria so that the proteins 
or peptides are accessible to the surrounding solution, and can, for example, be bound 

25 by a ligand that is bound specifically by the protein or peptide. In particular, the 
display of proteins on the surface of bacteria is desirable for the preparation of 
vaccines, the linkage of molecules such as antibiotic molecules or diagnostic reagents 
to cells, for screening reagents such as monoclonal antibodies, and for the selection of 
cloned proteins by displaying the cloned proteins, then observing their reaction with 

30 specific reagents such as antibodies. One way of doing this has been with phage 
display (G.P. Smith, "Filamentous Fusion Phage: Novel Expression Vectors that 
Display Cloned Antigens on the Virion Surface," Science 228:1315-1316 (1985)). 
However, phage display is limited in its practicality, because it requires that the 
protein being displayed to be inserted into a coat protein of filamentous phage and 

35 retain its activity while not distorting the conformation of the coat protein, allowing 
functional virions to be formed. In general, this technique is therefore limited only to 
small peptide and proteins. 
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Therefore, there is a need for a more general method of peptide and 
protein display. 

SUMMARY 

5 The present invention is directed to sortase-transamidase enzymes 

from Gram-positive bacteria, particularly Staphylococcus aureus, and methods for 
their use, particularly in the areas of drug screening and peptide and protein display. 

One aspect of the present invention is a substantially purified sortase- 
transamidase enzyme from a Gram-positive bacterium, the enzyme catalyzing a 

10 reaction that covalently cross-links the carboxyl terminus of a protein having a sorting 
signal to the peptidoglycan of a Gram-positive bacterium, the sorting signal having a 
motif of LPX3X4G therein, wherein sorting occurs by cleavage between the fourth and 
fifth residues of the LPX3X4G motif. Typically, the Gram-positive bacterium is a 
species selected from the group consisting of Staphylococcus aureus, S. sobrinus, 

15 Enterococcus faecalis, Streptococcus pyogenes, and Listeria monocytogenes. 
Preferably, the Gram-positive bacterium is S. aureus. The enzyme may be a 
heterooligomer. 

Preferably, the enzyme has at least one subunit with a molecular weight 
of about 41,000 daltons and the sorting signal further includes: (2) a substantially 

20 hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and (3) a 
charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 

25 and X4 is selected from the group consisting of alanine, serine, and threonine. 

Preferably, the enzyme includes therein a subunit whose amino acid 
sequence is selected from the group consisting of: (1) D-P-K-L-K-E-I-Y-Q-I-V- 
L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E-A-D-A-I-S-R-N-Y-L- 
E-S-K-G-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L-E-I-H-E-G-P-M-L-A- 

30 R-T-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V-Y-I-E-G-L-G-I-R-I-E- 
D-D-I-L-I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D-L-I-V-L-T (SEQ ID NO: 
2); (2) M-V-K-V-T-D-Y-S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S- 
I-A-T-S-E-V-E-G-I-T-G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L- 
S-R-D-L-K-I-E-S-K-E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T- 

35 A-N-K-I-Q-T-S-I-F-N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-I-T-Q-I- 
V-I-E-K (SEQ ID NO: 31) and (3) sequences incorporating one or more 
conservative amino acid substitutions in SEQ ID NO:2 or SEQ ID NO: 3 1 , wherein 
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the conservative amino acid substitutions are any of the following: (1) any of 
isoleucine, leucine, and valine for any other of these amino acids; (2) aspartic acid for 
glutamic acid and vice versa; (3) glutamine for asparagine and vice versa; and (4) 
serine for threonine and vice versa. 
5 Another aspect of the present invention is a nucleic acid sequence 

encoding this enzyme. In one alternative, the nucleic acid sequence includes therein a 
sequence selected from the group consisting of: (1) 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 
AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 

10 TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 
TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 
TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 
CTGGTGTTTATATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATT 
TTAATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTT 

1 5 ATAGTTTTAACATAA (SEQ ID NO: 28); (2) 

ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGAAAAA 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 

20 TGGCATATATATAGATGTATATTGTGCATTAAAACATGGTAATATTTCAAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAATTTCT 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30); and (3) a sequence complementary to SEQ ID NO: 
28 or SEQ ID NO: 30. In another alternative, the nucleic acid sequence can include a 

25 sequence hybridizing with SEQ ID NO: 28, SEQ ID NO: 30 or a sequence 

complementary to SEQ ID NO: 28 or SEQ ID NO: 30 with no greater than about a 
15% mismatch under stringent conditions. Preferably, the degree of mismatch is less 
than about 5%; more preferably, the degree of mismatch is less than about 2%. 

Yet another aspect of the present invention is a vector comprising the 

30 nucleic acid sequence of the present invention operatively linked to at least one 
control sequence that controls the expression or regulation of the nucleic acid 
sequence. 

Yet another aspect of the present invention is a host cell transfected 
with a vector of the present invention. 
35 Another aspect of the present invention is a method for producing a 

substantially purified sortase-transamidase enzyme. The method comprises the steps 
of: 
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(1) culturing a host cell according to the present invention under 
conditions in which the host cell expresses the encoded sortase-transamidase enzyme; 
and 

(2) purifying the expressed enzyme to produce substantially purified 
5 sortase-transamidase enzyme. 

Another aspect of the present invention is a method for screening a 
compound for anti-sortase-transamidase activity. This method is important in 
providing a way to screen for antibiotics that disrupt the sorting reaction and are likely 
to be effective in treating infections caused by Gram-positive bacteria. 
10 In one alternative, the screening method comprises the steps of: 

(1) providing a substantially purified sortase-transamidase enzyme 
according to the present invention; 

(2) performing an assay for sortase-transamidase in the presence and in 
the absence of the compound; and 

15 (3) comparing the activity of the sortase-transamidase enzyme in the 

presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

In another alternative, the screening method comprises the steps of: 

(1) providing an active fraction of sortase-transamidase enzyme from 
20 a Gram-positive bacterium; 

(2) performing an assay for sortase-transamidase in the presence and in 
the absence of the compound; and 

(3) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 

25 transamidase activity. 

The active fraction of sortase-transamidase activity can be a particulate 
fraction from Staphylococcus aureus. 

The assay for sortase-transamidase enzyme can be performed by 
monitoring the capture of a soluble peptide that is a substrate for the enzyme by its 

30 interaction with an affinity resin. In one alternative, the soluble peptide includes a 
sequence of at least six histidine residues and the affinity resin contains nickel. In 
another alternative, the soluble peptide includes the active site of glutathione S- 
transferase and the affinity resin contains glutathione. In yet another alternative, the 
soluble peptide includes the active site of streptavidin and the affinity resin contains 

35 biotin. In still another alternative, the soluble peptide includes the active site of 
maltose binding protein and the affinity resin contains amylose. 
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Still another aspect of the present invention is an antibody specifically 
binding the sortase-transamidase enzyme of the present invention. 

Yet another aspect of the present invention is a protein molecule 
comprising a substantially purified sortase-transamidase enzyme according to the 
5 present invention extended at its carboxyl-terminus with a sufficient number of 
histidine residues to allow specific binding of the protein molecule to a nickel- 
sepharose column through the histidine residues added at the carboxyl-terminus. 

Still another aspect of the present invention is a method for displaying 
a polypeptide on the surface of a Gram-positive bacterium comprising the steps of: 
10 (1 ) expressing a polypeptide having a sorting signal at its carboxy- 

terminal end, the sorting signal having: (a) a motif of LPX3X4G therein; (b) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(c) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
1 5 being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(2) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) a substantially purified sortase-transamidase according to the present 

20 invention; and (iii) a Gram-positive bacterium having a peptidoglycan to which the 
sortase-transamidase can link the polypeptide; and 

(3) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4 motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 

25 peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

Another display method according to the present invention comprises: 

(1 ) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 

30 carboxyl-terminal sorting signal as described above; 

(2) growing the bacterium into which the nucleic acid segment has 
been cloned to express the cloned chimeric protein to generate a chimeric protein 
including therein a carboxyl-terminal sorting signal; and 

(3) binding the polypeptide covalently to the cell wall by the enzymatic 
35 action of a sortase-transamidase expressed by the Gram-positive bacterium involving 

cleavage of the chimeric protein within the LPX3X4G motif so that the polypeptide is 
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displayed on the surface of the Gram-positive bacterium in such a way that the 
polypeptide is accessible to a ligand. 

Another aspect of the present invention is a polypeptide displayed on 
the surface of a Gram-Positive bacterium by covalent linkage of an amino-acid 
5 sequence of LPX3X4 derived from cleavage of an LPX3X4G motif, wherein X 3 is any 
of the twenty naturally-occurring L-amino acids and X4 is selected from the group 
consisting of alanine, serine, and threonine, the polypeptide being displayed on the 
surface of the Gram-positive bacterium in such a way that the polypeptide is 
accessible to a ligand. 
10 Another aspect of the present invention is a covalent complex 

comprising: 

(1) the displayed polypeptide; and 

(2) an antigen or hapten covalently cross-linked to the polypeptide. 
Yet another aspect of the present invention is a method for vaccination 

15 of an animal comprising the step of immunizing the animal with the displayed 
polypeptide to generate an immune response against the displayed polypeptide, or, 
alternatively, with the covalent complex to generate an immune response against the 
antigen or the hapten. 

Still another aspect of the present invention is a method for screening 

20 for expression of a cloned polypeptide comprising the steps of: 

(1) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end as described above; 

(2) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) a substantially purified sortase-transamidase enzyme according to the 

25 present invention; and (iii) a Gram-positive bacterium having a peptidoglycan to 
which the sortase-transamidase can link the polypeptide through the sorting signal; 

(3) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 

30 that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(4) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

35 Still another aspect of the present invention is a method for the 

diagnosis or treatment of a bacterial infection caused by a Gram-positive bacterium 
comprising the steps of: 
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(1) conjugating an antibiotic or a detection reagent to a protein 
including therein a carboxyl-terminal sorting signal as described above to produee a 
conjugate; and 

(2) introducing the conjugate to an organism infected with a Gram- 

5 positive bacterium in order to cause the conjugate to be sorted and covalently cross- 
linked to the cell walls of the bacterium in order to treat or diagnose the infection. 

If an antibiotic is used, typically it is a penicillin, ampicillin, 
vancomycin, gentamicin, streptomycin, a cephalosporin, amikacin, kanamycin, 
neomycin, paromomycin, tobramycin, ciprofloxacin, clindamycin, rifampin, 

10 chloramphenicol, norfloxacin, or a derivative of these antibiotics. 

Similarly, another aspect of the present invention is a conjugate 
comprising an antibiotic or a detection reagent covalently conjugated to a protein 
including therein a carboxyl-terminal sorting signal as described above to produce a 
conjugate. In still another aspect of the present invention, a composition comprises 

15 the conjugate with a pharmaceutical^ acceptable carrier. 

Another aspect of the present invention is a substantially purified 
protein having at least about 50% match with best alignment with the amino acid 
sequences of at least one of the putative Bacillus peptidase (SEQ ID NO: 3), the 
aminopeptidase P of Lactococcus lactis (SEQ ID NO: 4), or the proline dipeptidase of 

20 Lactobacillus delbrueckii lactis (SEQ ID NO: 5) and having sortase-transamidase 
activity. Preferably, the match is at least about 60% in best alignment; more 
preferably, the match is at least about 70% in best alignment. 

Another aspect of the present invention is a substantially purified 
protein having sortase-transamidase activity and a hydrophobicity profile of at least 

25 one subunit of the protein that, determined as the mean absolute value of the 

hydrophobicity difference per residue, differs from the hydrophobicity profile of a 
putative Bacillus peptidase (SEQ ID NO: 3) by no more than about 2 units on the 
hydrophobicity scale. Preferably, the difference is not more than about 1 unit; most 
preferably, it is not more than about 0.5 units. 

30 Another aspect of the present invention is a substantially purified 

protein having sortase-transamidase activity and a hydrophobicity profile of at least 
one subunit of the protein that, determined as the mean absolute value of the 
hydrophobicity difference per residue, differs from the hydrophobicity profile of the 
sequence of SEQ ID NO: 31 by no more than about 2 units on the hydrophobicity 

35 scale. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



These and other features, aspects, and advantages of the present 
invention will become better understood with reference to the following description 
5 and accompanying drawings where: 

Figure 1 is a diagram showing the substantial homology of the amino 
acid sequence of the sortase-transamidase enzyme of Staphylococcus aureus to an 
open reading frame in the genome of Streptococcus pyogenes (SEQ ID NO: 2 & 34); 

Figure 2 is a diagram showing a greater homology of the amino acid 
10 sequence of the sortase-transamidase enzyme of Staphylococcus aureus to the 

carboxyl-terminal portion of the open reading frame in the genome of Streptococcus 
pyogenes (SEQ ID NO: 2 & 34); 

Figure 3 is the DNA sequence of the S. pyogenes open reading frame 
(SEQ ID NO: 33&34); 
15 Figure 4 (SEQ ID NO: 34) is the entire amino acid sequence of the 

protein translated from the entire 5. pyogenes open reading frame; 

Figure 5 (SEQ ID NO: 3) is the amino acid sequence of a putative 
Bacillus peptidase in the GCVT-SPOIIIAA intergenic region; 

Figure 6 is the hydrophobicity profile of the protein whose amino acid 
20 sequence is shown in Figure 5 (SEQ ID NO: 3); 

Figure 7 (SEQ ID NO: 4) is the amino acid sequence of the 
aminopeptidase P of Lactococcus lactis; 

Figure 8 (SEQ ID NO: 5) is the amino acid sequence of the proline 
dipeptidase of Lactobacillus delbrueckii lactis; 
25 Figure 9 is a diagram of the activity of the sortase-transamidase 

enzyme of the present invention; 

Figure 10 (SEQ ID NOS: 28 & 29) is a partial DNA sequence of the 
gene for one of the subunits of the sortase-transamidase einzyme of S. aureus; 

Figure 1 1 (SEQ ID NO: 2) is the partial carboxyl-terminal amino acid 
30 sequence translated from the DNA sequence of Figure 1 0 (SEQ ID NOS: 28 & 29); 

Figure 12 (SEQ ID NOS: 30 & 3 1) is a partial DNA sequence of the 
gene for a second of the subunits of the sortase-transamidase enzyme of S. aureus; 
and 

Figure 13 is the hydrophobicity profile of the protein translated from 
35 the DNA sequence of Figure 12 (SEQ ID NOS: 30 & 31). 
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DEFINITIONS 

As used herein, the terms defined below have the following meanings 
unless otherwise indicated: 

"Nucleic Acid Sequence": the term "nucleic acid sequence" includes 
5 both DNA and RNA unless otherwise specified, and, unless otherwise specified, 
includes both double-stranded and single-stranded nucleic acids. Also included are 
hybrids such as DNA-RNA hybrids. In particular, a reference to DNA includes RNA 
that has either the equivalent base sequence except for the substitution of uracil and 
RNA for thymine in DNA, or has a complementary base sequence except for the 
1 0 substitution of uracil for thymine, complementarity being determined according to the 
Watson-Crick base pairing rules. Reference to nucleic acid sequences can also 
include modified bases as long as the modifications do not significantly interfere 
either with binding of a ligand such as a protein by the nucleic acid or with Watson- 
Crick base pairing. 

15 "Antibody": as used herein the term "antibody" includes both intact 

antibody molecules of the appropriate specificity, and antibody fragments (including 
Fab, F(ab'), Fv, and F(ab') 2 ), as well as chemically modified intact antibody 
molecules and antibody fragments, including hybrid antibodies assembled by in vitro 
reassociation of subunits. Also included are single-chain antibody molecules 

20 generally denoted by the term sFv and humanized antibodies in which some or all of 
the originally non-human constant regions are replaced with constant regions 
originally derived from human antibody sequences. Both polyclonal and monoclonal 
antibodies are included unless otherwise specified. Additionally included are 
modified antibodies or antibodies conjugated to labels or other molecules that do not 

25 block or alter the binding capacity of the antibody. 

DESCRIPTION 

A substantially purified sortase-transamidase enzyme from Gram- 
positive bacteria, particularly Staphylococcus aureus. 
30 The properties of this enzyme make it a logical target for antibiotic action. This 
enzyme also catalyzes covalent crosslinkage of proteins to the peptidoglycan of 
Gram-positive bacteria. 

I. THE SORTASE-TRANSAMIDASE ENZYME 
35 One aspect of the invention is a substantially purified sortase- 

transamidase enzyme from a Gram-positive bacterium. As used herein, the term 
"substantially purified" means having a specific activity of at least tenfold greater than 
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the sortase-transamidase activity present in a crude extract, lysate, or other state from 
which proteins have not been removed and also in substantial isolation from proteins 
found in association with sortase-transamidase in the cell. 

One subunit of the enzyme has a molecular weight of about 4 1 ,000 
5 daltons. The enzyme catalyzes a reaction that covalently crosslinks the carboxyl- 
terminus of a protein having a sorting signal to the peptidoglycan of the GTam- 
positive bacterium. The sorting signal has: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 

10 substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 3 1-33 
from the motif. In this sorting signal, X 3 can be any of the twenty naturally-occurring 
L-amino acids. X4 can be alanine, serine, or threonine. Preferably, X4 is threonine. 

The sortase-transamidase is believed to occur in all Gram-positive 

15 bacteria. In particular, the enzyme exists in Mycobacterium, Nocardia, Actinomyces, 
Staphylococcus, Streptococcus, Listeria, Enterococcus, and Pneumococcus. 
Specifically, the enzyme exists in the following species: Staphylococcus aureus, S. 
sobrinus, Enterococcus faecalis, Streptococcus pyogenes, and Listeria 
monocytogenes. 

20 Preferably the enzyme is isolated from Staphylococcus aureus. 

A. Amino Acid Sequence 

The sortase-transamidase of the present invention includes therein an 
amino acid sequence, in one subunit of the enzyme, of D-P-K-L-K-E-I-Y-Q-I- 
V-L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E-A-D-A-I-S-R-N-Y- 

25 L-E-S-KH3-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L-E-I-H-E-G-P-M-L- 
A-R-P-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V-Y-I-E-G-L-G-G-I- 
R-I-E-D-D-I-L-I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D-L-I-V-L-T (SEQ 
ID NO:2). This sequence is at the carboxyl-terminal end of the subunit of the 
enzyme. 

30 The sortase-transamidase of the present invention also includes therein 

an amino acid sequence, in a second subunit of the enzyme, of M-V-K-V-T-D-Y- 
S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S-I-A-T-S-E-V-E-G-I-T- 
G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L-S-R-D-L-K-I-E-S-K- 
E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T-A-N-K-I-Q-T-S-I-F- 

35 N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-I-T-Q-I-V-FE-K (SEQ ID NO: 
31). 
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Also within the scope of the present invention are substantially purified 
protein molecules that are mutants of the sequence of SEQ ID NO:2 or SEQ ID NO: 
3 1 that preserve the sortase-transamidase activity. In particular, the conservative 
amino acid substitutions can be any of the following: (1) any of isoleucine, leucine, 
5 and valine for any other of these amino acids; (2) aspartic acid for glutamic acid and 
vice versa; (3) glutamine for asparagine and vice versa; and (4) serine for threonine 
and vice versa. 

Other substitutions can also be considered conservative, depending 
upon the environment of the particular amino acid. For example, glycine (G) and 

10 alanine (A) can frequently be interchangeable, as can be alanine and valine (V). 

Methionine (M), which is relatively hydrophobic, can frequently be interchanged with 
leucine and isoleucine, and sometimes with valine. Lysine (K) and arginine (R) are 
frequently interchangeable in locations in which the significant feature of the amino 
acid residue is its charge and the different pK's of these two amino acid residues or 

15 their different sizes are not significant. Still other changes can be considered 
"conservative" in particular environments. For example, if an amino acid on the 
surface of a protein is not involved in a hydrogen bond or salt bridge interaction with 
another molecule, such as another protein subunit or a ligand bound by the protein, 
negatively charged amino acids such as glutamic acid and aspartic acid can be 

20 substituted for by positively charged amino acids such as lysine or arginine and vice 
versa. Histidine (H), which is more weakly basic than arginine or lysine, and is 
partially charged at neutral pH, can sometimes be substituted for these more basic 
amino acids. Additionally, the amides glutamine (Q) and asparagine (N) can 
sometimes be substituted for their carboxylic acid homologues, glutamic acid and 

25 aspartic acid. 

The sortase-transamidase from Staphylococcus aureus has substantial 
homology the amino acid sequence of the first subunit, that of SEQ ID NO : 2, with an 
open reading frame in the genome of Streptococcus pyogenes, particularly in the 
amino-terminal region. There is about a 22% match with best alignment over the 

30 entire sequenced region of the S. pyogenes open reading frame, and about a 47% 

match with best alignment over the carboxyl-terminal region of the 5. pyogenes open 
reading frame. These matches are shown in Figures 1-2. The DNA sequence of the 
entire S. pyogenes open reading frame is shown in Figure 3 (SEQ ID NO: 33 & 34). 
The protein translated from the entire S. pyogenes open reading frame has a molecular 

35 weight of about 40,85 1 .43 daltons; its sequence is shown in Figure 4 (SEQ ID NO: 
34). Therefore, another aspect of the present invention is a substantially purified 
protein molecule that has at least one subunit of about 40,000 to about 41,000 daltons 
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in molecular weight, the subunit having at least a 20% match with best alignment with 
the S. pyogenes open reading frame of Figure 2 and that has sortase-transamidase 
activity. Preferably, the subunit has at least a 30% match with best alignment; more 
preferably, at least 50% match with best alignment 
5 As shown below in the Example, the first 364 bases of a nucleic acid 

segment that complements a temperature-sensitive mutation in the S. aureus sortase- 
transamidase, designated the SM-317 complementing gene insert, has been identified 
as encoding a protein sequence that is a homologue of a putative Bacillus peptidase in 
the GCVT-SPOIIIAA intergenic region (GenBank Accession No. 1731048; Y. 

10 Kobayashi et al.). The sequence of this putative peptidase is shown in Figure 5 (SEQ 
ID NO:3) and its hydrophobicity profile is shown in Figure 6. Thehydrophobicity is 
calculated according to the method of J. Kyte & R.F. Doolittle, "A Simple Method for 
Displaying the Hydropathic Character of a Protein," J. Mol. Biol. 157: 105-132 
(1982). As used herein, the term "hydrophobicity" is the hydrophobicity as calculated 

15 in Kite & Doolittle, supra . 

To a lesser degree of homology, the protein sequence encoded by this 
complementing gene insert is homologous to aminopeptidase P of Lactococcus lactis 
(GenBank Accession No. 1915907; J. Matos). The amino acid sequence of this 
aminopeptidase is shown in Figure 7 (SEQ ID NO: 4). To a still lesser degree of 

20 homology, the protein sequence encoded by this complementing gene insert is 

homologous to the proline dipeptidase of Lactobacillus delbrueckii lactis (GenBank 
Accession No. 1 172066; K. Stucky et al., "Cloning and DNA Sequence Analysis of 
pepQ, a Prolidase Gene from Lactobacillus delbrueckii subsp. lactis and Partial 
Characterization of Its Product," Mol. Gen. Genet. 247: 494-500 (1995)). The amino 

25 acid sequence of this proline dipeptidase is shown in Figure 8 (SEQ ID NO:5). 

Because of the relatedness of these proteins, another aspect of the 
present invention is a substantially purified protein having at least one subunit with at 
least about 50% match with best alignment with the amino acid sequences of at least 
one of the putative Bacillus peptidase (SEQ ID NO: 3), the aminopeptidase P of 

30 Lactococcus lactis (SEQ ID NO: 4), or the proline dipeptidase of Lactobacillus 
delbrueckii lactis (SEQ ID NO: 5) and having sortase-transamidase activity. 
Preferably, the at least one subunit of the protein has at least about 60% match with 
best alignment with at least one of these sequence; more preferably, the at least one 
subunit of the protein has at least about 70% match with best alignment with at least 

35 one of these sequences. 

Because the hydrophobicity of a protein is a sensitive measure of 
protein structure, another aspect of the invention is a substantially purified protein 
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having sortase-transamidase activity and a hydrophobicity profile for at least one 
subunit of the protein that, determined as the mean absolute value of the 
hydrophobicity difference per residue, differs from the hydrophobicity profile of the 
putative Bacillus peptidase by no more than about 2 units on the hydrophobicity scale 
5 of Kyte & Doolittle, supra . Preferably, the difference is no greater than about 1 unit; 
more preferably, the difference is no greater than about 0.5 units. 

The sortase-transamidase is a cysteine protease. 

B. Activity of the Sortase-Transamidase 

10 The activity of the sortase-transamidase enzyme of the present 

invention is shown, in general, in Figure 9. The enzyme first cleaves a polypeptide 
having a sorting signal within the LPX3X4G motif. Cleavage occurs after residue X4, 
normally a threonine; as indicated above, this residue can also be a serine or alanine 
residue. This residue forms a covalent intermediate with the sortase. The next step is 

15 the transamidation reaction that transfers the cleaved carboxyl terminus of the protein 
to be sorted to the -NH 2 of the pentaglycine crossbridge within the peptidoglycan 
precursor. The peptidoglycan precursor is then incorporated into the cell wall by a 
transglycosylase reaction with the release of undecaprenyl phosphate. The mature 
anchored polypeptide chains are thus linked to the pentaglycine cross bridge in the cell 

20 wall which is tethered to the e-amino side chain of an unsubstituted cell wall 
tetrapeptide. A carboxypeptidase may cleave a D-Ala-D-Ala bond of the 
pentapeptide structure to yield the final branched anchor peptide in the staphylococcal 
cell wall. 

The sorting signal has: (1) a motif of LPX3X4G therein; (2) a 
25 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region. 

In the motif, X3 can be any of the 20 naturally-occurring L-amino 
acids. X4 can be any of threonine, serine, or alanine. Preferably, X4 is threonine (O. 
Schneewind et al., "Cell Wall Sorting Signals in Surface Proteins of Gram-Positive 
30 Bacteria," EMBQ J. 12:4803-48 1 1 (1993)). 

Preferably, the substantially hydrophobic domain carboxyl to the motif 
includes no more than about 7 charged residues or residues with polar side chains. 
For the purposes of this specification, these residues include the following: aspartic 
acid, glutamic acid, lysine, and arginine as charged residues, and serine, threonine, 
35 glutamine, and asparagine as polar but uncharged residues. Preferably, the sequence 
includes no more than three charged residues. 
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Representative sequences suitable for sorting signals for use with the 
sortase-transamidase of the present invention include, but are not limited to the - 
following: E-E-N-P-FH-G-T-T»V-F-G-(3-L-S-L-A-L-G-A-A«L-L-A-G 
(SEQ ID NO: 6), the hydrophobic domain of the staphylococcal proteinase (SPA)* 
5 sorting signal from Staphylococcus aureus; (2) G-E-E-S-T-N-K-G-M-L-F-G- 
G-L-F-S-I-L-G-L-A-L-L (SEQ ID NO:7), the SNBP signal of & aureus; (3) D- 
S-S-N-A-Y-L-P-L-L-G-L-V-S-L-T-A-G-F-S-L-L-G-L (SEQ ID NO: 8), 
the SPAA signal of S sobrinus, (4) E-K-Q-N-V-L-L-T-V-V-G-S-L-A-A-M- 
L-G-L-A-G-L-G-F (SEQ ID NO:9), the PRGB signal of Enterococcus faecalis, 

10 (5) S-I-G-T-Y-L-F-K-I-G-S-A-A-M-I-G-A-I-G-I-Y-I-V (SEQ ID NO: 1 0), 
the TEE signal of Streptococcus pyogenes, and (6) D-S-D-N-A-L-Y-L-L-L-G- 
L-L-A-V-G-T-A-M-A-L-T (SEQ ID NO: 1 1 ), the INLA signal of Listeria 
monocytogenes. Other hydrophobic domains can be used as part of the sorting signal. 

The third portion of the sorting signal is a charged tail region with at 

15 least two positively charged residues carboxyl to the substantially hydrophobic 

domain. At least one of the two positively charged residues is arginine. The charged 
tail can also contain other charged amino acids, such as lysine. Preferably, the 
charged tail region includes two or more arginine residues. The two positively 
charged residues are located at residues 31-33 from the motif. Preferably, the two 

20 arginine residues are either in succession or are separated by no more than one 

intervening amino acid. Preferably, the charged tail is at least five amino acids long, 
although four is possible. Among the charged tails that can be used are the following: 
(1) R-R-R-E-L (SEQ ID NO:12), from the SPA signal of S aureus; (2) R-R-N-K- 
K-N-H-K-A (SEQ ID NO: 13), from the SNBP signal of S aureus; (3) R-R-K-Q- 

25 D (SEQ ID NO:14), from the SPAA signal of S sobrinus; (4) K-R-R-K-E-T-K 
(SEQ ID NO: 1 5), from the PRGB signal of £ faecalis; (5) K-R-R-K-A (SEQ ID 
NO: 16), from the TEE signal of S. pyogenes; (6), K-R-R-H-V-A-K-H (SEQ ID 
NO: 17), from the FIM sorting signal of Actinomyces viscosus, and (7) K-R-R-K-S 
(SEQ ID NO: 1 8), from the BAC sorting signal of Streptococcus aglactiae; (8) K-R- 

30 K-E-E-N (SEQ ID NO: 1 9), from the EMM signal of Streptococcus pyogenes. 

Also usable as the charged tail portion of the sorting signal are the 
following sequences produced by mutagenesis from the SPA signal of 5. aureus. 
These include R-R-R-E-S (SEQ ID NO: 20), R-R-R-S-L (SEQ ID NO: 21), R- 
R-S-E-L (SEQ ID NO: 22), R-S-R-E-L (SEQ ID NO: 23) and S-R-R-E-L (SEQ 

35 ID NO: 24). Other charged tails that are usable as part of the sorting signal can be 
derived from a poly serine tail, itself inactive, by replacement of one or more of the 
serine residues with the basic amino acid arginine. These include R-R-S-S-S (SEQ 
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ID NO: 25), R-S-R-S-S (SEQ ID NO:26) 5 and S-R-R-S-S (SEQ ID NO:27). 
Other sorting signals can also be used. 

II. THE GENE ENCODING THE SORTASE-TRANSAMIDASE ENZYME 
5 A. Isolation of t he Sortase- Transamidase Enzvme Gene 

The gene for the sortase-transamidase enzyme in Staphylococcus 
aureus has been isolated. The isolation process is described in detail in the Example 
below; in general, this process comprises: (1) the generation of temperature-sensitive 
mutants through chemical mutagenesis, such as with the DNA modifying agent N- 

10 methyl-N-nitro-N-nitrosoguanidine; (2) Screening for temperature-sensitive 

mutants; (3) screening the temperature-sensitive mutants for a block in protein sorting 
by the use of a construct harboring the staphylococcal enterotoxin B (SEB) gene fused 
to the cell wall sorting signal of staphylococcal Protein A (SPA), to locate mutants 
that accumulate a precursor molecule formed by cleavage of an amino-terminal signal 

15 peptide but that is not then processed by cleavage of the carboxyl-terminal sorting 
signal; (4) generation of a S. aureus chromosomal library and complementation of the 
temperature-sensitive sorting defect; and (5) sequencing and characterization of the S. 
aureus complementing determinants. 

20 B. Sequence of the Sortase-Transamidase Gene 

The above procedure yielded a partial sequence for one of the subunits 
of the sortase-transamidase including the carboxyl-terminal portion of the gene for 
the first subunit This sequence is 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 
25 AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 
TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 
TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 
TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 
CTGGTGTTTATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATTTT 
30 AATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTTA 
TAGTTTTAACATAA (SEQ ID NO:28 & 29). The last three nucleotides, TAA, of 
this sequence are the stop codon. 

The above procedure further yielded a sequence for a second subunit of 
ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
35 AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGAAAAA 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 
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TGGCATATATATAGATGTATATTGTGCATTAAAACATGGTAATATTTCAAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAAm 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30 & 31) The last three nucleotides of this sequence, 

5 TAA, are the stop codon. 

Accordingly, within the scope of the present invention is a nucleic acid 
sequence encoding a substantially purified sortase-transamidase enzyme from a 
Gram-positive bacterium. The enzyme encoded has at least one subunit with a 
molecular weight of about 41,000 daltons and catalyzes a reaction that covalently 

1 0 cross-links the carboxyl-terminus of a protein having the sorting signal described 
above to the peptidoglycan of a gram-positive bacterium. The nucleic acid sequence 
includes therein the sequence of SEQ ID NO: 28 or a sequence complementary to 
SEQ ID NO: 28, or the sequence of SEQ ID NO: 30 or a sequence complementary to 
SEQ ID NO: 30. 

15 Also included within the present invention is a nucleic acid sequence 

encoding a substantially purified sortase-transamidase enzyme from a Gram-positive 
bacterium with at least one subunit with a molecular weight of about 4 1 ,000 daltons, 
where the enzyme catalyzes the cross-linking reaction where the nucleic acid 
sequence hybridizes with at least one of: (1) the sequence of SEQ ID NO: 28; (2) a 

20 sequence complementary to SEQ ID NO: 28; (3) the sequence of SEQ ID NO: 30; or 
(4) a sequence complementary to SEQ ID NO: 30 with no greater than about a 1 5% 
mismatch under stringent conditions. Preferably, the degree of mismatch is no greater 
than about 5%; most preferably the mismatch is no greater than about 2%. 

Also within the present invention is a nucleic acid sequence encoding a 

25 substantially purified sortase-transamidase enzyme from a Gram-positive bacterium 
where the enzyme has at least one subunit with a molecular weight of about 4 1 ,000 
daltons and catalyzes the cross-linking reaction described above involving the sorting 
signal, where the enzyme includes therein an amino acid sequence selected from the 
group consisting of: (1) D-P-K-L-K-E-I-Y-Q-I-V-L-E-S-Q-M-K-A-I-N-E- 

30 I-R-P-D-M-T-G-A-E-A-D-A-I-S-R-N-Y-L-E-S-K-G-Y-G-K-E-F-G- 
H-S-L-G-H-G-I-G-L-E-I-H-E-G-P-M-L-A-R-T-I-Q-D-K-L-Q-V-N- 
N-C-V-T-V-E-P-G-V-Y-I-E-G-L-G-I-R-I-E-D-D-I-L-I-T-E-N-G-C- 
Q-V-F-T-K-C-T-K-D-L-I-V-L-T (SEQ ID NO:2); (2) M-V-K-V-T-D-Y- 
S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S-I-A-T-S-E-V-E-G-I-T- 

35 G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L-S-R-D-L-K-I-E-S-K- 
E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T-A-N-K-I-Q-T-S-I-F- 
N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-I-T-Q-I-V-I-E-K <SEQ ID NO: 
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31); and (3) sequences incorporating one or more conservative amino acid 
substitutions in SEQ ID NO:2 or SEQ ID NO: 31 wherein the conservative amino acid 
substitutions are any of the following: (1) any of isoleucine, leucine and valine for any 
other of these amino acids; (2) aspartic acid for glutamic acid and vice versa; (3) 
5 glutamine for asparagine and vice versa; and (4) serine for threonine and vice versa. 
Alternative nucleic acid sequences can be determined using the standard genetic code; 
the alternative codons are readily determinable for each amino acid in this sequence. 

Construction of nucleic acid sequences according to the present 
invention can be accomplished by techniques well known in the art, including solid- 
10 phase nucleotide synthesis, the polymerase chain reaction (PCR) technique, reverse 
transcription of DNA from RNA, the use of DNA polymerases and ligases, and other 
techniques. If an amino acid sequence is known, the corresponding nucleic acid 
sequence can be constructed according to the genetic code. 

15 C. Vectors and Host Cells Transformed with Vectors 

Another aspect of the invention is a vector comprising a nucleic acid 
sequence according to the present invention operatively linked to at least one control 
sequence that controls the expression or regulation of the nucleic acid sequence. Such 
control sequences are well known in the art and include operators, promoters, 

20 enhancers, promotei^proximal elements and replication origins. The techniques of 
vector construction, including cloning, ligation, gap-filling, the use of the polymerase 
chain reaction (PCR) procedure, solid-state oligonucleotide synthesis, and other 
techniques, are all well known in the art and need not be described further here. 

Another aspect of the present invention is a host cell transfected with a 

25 vector according to the present invention. Among the host cells that can be used are 
gram-positive bacteria such as Staphylococcus aureus. 

Transfection, also known as transformation, is done using standard 
techniques appropriate to the host cell used, particularly Staphylococcus aureus. Such 
techniques are described, for example, in R.P. Novick, "Genetic Systems in 

30 Staphylococci," Meth. Enzvmol. 204: 587-636 ( 1 99 1 ), as well as in O. Schneewind et 
al., "Sorting of Protein A to the Staphylococcal Cell Wall," £eU 70: 267-281 (1992). 

III. SORTASE-TRANSAMIDASE AS A TARGET FOR ANTIBIOTIC ACTION 
A. A Site for Antibiotic Action 
35 The reaction carried out by the sortase-transamidase of the present 

invention presents a possible target for a new class of antibiotics to combat medically 
relevant infections caused by numerous gram-positive organisms. Because this is a 
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novel site of antibiotic action, these antibiotics have the advantage that resistance by 
the bacterium has not had a chance to develop. 

Such antibiotics can include compounds with structures that mimic the 
cleavage site, such as compounds with a structure similar to methyl 
5 methanethiosulfonate or, more generally, alkyl methanethiosulfonates. The sortase- 
transamidase of the present invention is believed to be a cysteine protease. Other 
antibiotics that may inhibit the activity of the sortase-transamidase in the present 
invention include inhibitors that would be specific for cysteine-modification in a {3- 
lactam framework. These inhibitors would have active moieties that would form 

10 mixed disulfides with the cysteine sulfhydryl. These active moieties could be 

derivatives of methanethiosulfonate, such as methanethiosulfonate ethylammonium, 
methanethiosulfonate ethyltrimethylammonium, or methanethiosulfonate 
ethylsulfonate (J.A. Javitch et al., "Mapping the Binding Site Crevice of the 
Dopamine D2 Receptor by the Substituted-Cysteine Accessibility Method," Neuron . 

15 14: 825-83 1 (1995); M.H. Akabas & A. Karlin, "Identification of Acetylcholine 
Receptor Channel-Lining Residues in the Ml Segment of the a-Subunit," 
Biochemistry 34: 12496-12500 (1995)). Similar reagents, such as alkyl 
alkanethiosulfonates, i.e., methyl methanethiosulfonate, or alkoxycarbonylalkyl 
disulfides, have been described (D.J. Smith et al., "Simple Alkanethiol Groups for 

20 Temporary Blocking of Sulfhydryl Groups of Enzymes." Biochemistry 14: 766-771 
(1975); W.N. Valentine & D.E. Paglia, "Effect of Chemical Modification of 
Sulfhydryl Groups of Human Erythrocyte Enzymes." Am. J. Hematol. 11: 1 1 1-124 
(1 98 1 )). Other useful inhibitors involve derivatives of 2-trifluoroacetylaminobenzene 
sulfonyl fluoride (J.C. Powers, "Proteolytic Enzymes and Their Active-Site-Specific 

25 Inhibitors: Role in the Treatment of Disease," in Modification of Proteins) , in a p- 
lactam framework, peptidyl aldehydes and nitriles (E. Dufour et al., "Peptide 
Aldehydes and Nitriles as Transition State Analog Inhibitors of Cysteine Proteases," 
Biochemistry 34: 9136-9143 (1995); J. O. Westerik & R. Wolfenden, "Aldehydes as 
Inhibitors of Papain," J. Biol. Chem. 247: 8195-8197 (1972)), peptidyl diazomethyl 

30 ketones (L. Bjorck et al., "Bacterial Growth Blocked by a Synthetic Peptide Based on 
the Structure of a Human Proteinase Inhibitor "Nature 337: 385-386 (1989)), 
peptidyl phosphonamidates (P.A. Bartlett & C.K. Marlowe, "Phosphonamidates as 
Transition-State Analogue Inhibitors of Thermolysin," Biochemistry 22: 4618-4624 
(1983)), phosphonate monoesters such as derivatives or analogues of m- 

35 carboxyphenyl phenylacetamidomethylphosphonate (R.F. Pratt, "Inhibition of a Class 
C p-Lactamase by a Specific Phosphonate Monoester," Science 246: 917-919 
(1989)), maleimides and their derivatives, including derivatives of such Afunctional 
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maleimides as o-phenylenebismaleimide, p-phenylenebismaleimide, m- 
phenylenebismaleimide, 2,3-naphthalenebismaleimide, 1,5- 
naphthalenebismaleimide, and azophenylbismaleimide, as well as monofunctional 
maleimides and their derivatives (J.V. Moroney et al., "The Distance Between Thiol 
5 Groups in the y Subunit of Coupling Factor 1 Influences the Proton Permeability of 
Thylakoid Membranes/' J. Bioenerget. Biomembr. 14: 347-359 (1982)), peptidyl 
halomethyl ketones (chloromethyl or fluoromethyl ketones), peptidyl sulfonium salts, 
peptidyl acyloxymethyl ketones, derivatives and analogues of epoxides, such as E-64 
(N-[N-(L-ti^is-carboxyoxiran-2-carbonyl)-L-leucylagmatine), E-64c (a 

10 derivative of E-64 in which the agmatine moiety is replaced by an isoamylamine 
moiety), E-64c ethyl ester, Ep-459 (an analogue of E-64 in which the agmatine 
moiety is replaced by a 1 ,4-diaminopropyl moiety), Ep-479 (an analogue of E-64 in 
which the agmatine moiety is replaced by a 1 ,7-diheptylamino moiety), Ep-460 (a 
derivative of Ep-459 in which the terminal amino group is substituted with a Z 

1 5 (benzyloxycarbonyl) group), Ep-1 74 (a derivative of E-64 in which the agmatine 
moiety is removed, so that the molecule has a free carboxyl residue from the leucine 
moiety), Ep-475 (an analogue of E-64 in which the agmatine moiety is replaced with 
a NH2"(CH 2 )2"CH-(CH 3 )2 moiety), or Ep-420 (a derivative of E-64 in which the 
hydroxyl group is benzoylated, forming an ester, and the leucylagmatine moiety is 

20 replaced with isoleucyl-O-methyltyrosine), or peptidyl O-acyl hydroxamates (E 
Shaw, "Cysteinyl Proteases and Their Selective Inactivation), pp 271-347). Other 
inhibitors are known in the art. 

B. Screening Methods 
25 Another aspect of the present invention is a method for screening a 

compound for anti-sortase— transamidase activity. This is an important aspect of the 
present invention, because it provides a method for screening for compounds that 
disrupt the sorting process and thus have potential antibiotic activity against Gram- 
positive bacteria. 

30 In general, this method comprises the steps of: (1) providing an active 

fraction of sortase-transamidase enzyme; (2) performing an assay for sortase- 
transamidase activity in the presence and in the absence of the compound being 
screened; and (3) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound. 

35 The active fraction of sortase-transamidase enzyme can be a 

substantially purified sortase-transamidase enzyme preparation according to the 



WO 99/09145 



PCT/US98/16229 



-21- 

present invention, but can be a less purified preparation, such as a partially purified 
particulate preparation as described below. 

The enzymatic activity can be measured by the cleavage of a suitable 
substrate, such as the construct having the Staphylococcal Enterotoxin B (SEB) gene 
5 fused to the cell wall sorting signal of Staphylococcal Protein A (SPA). The cleavage 
can be determined by monitoring the molecular weight of the products by sodium 
dodecyl sulfate-polyacrylamide gel electrophoresis or by other methods. 

One particularly preferred assay for sortase-transamidase activity is the 

following: 

10 Staphylococcal soluble RNA (sRNA) is prepared from S. aureus by a 

modification of the technique of Zubay (G. Zubay, J. Mol. Biol. 4: 347-356 (1962)). 
An overnight culture of S. aureus is diluted 1:10 in TSB and incubated at 37°C for 3 
hr. The cells are harvested by centrifugation at 6000 rpm for 15 min. 

For every gram of wet cell pellets, 2 ml of 0.0 1 M magnesium acetate, 

15 0.001 M Tris, pH 7.5 is used to suspend the pellets. The cell pellets are beaten by 
glass bead beater for 45 minutes in 5 minute intervals. The suspension is centrifiiged 
twice at 2500 rpm for 5 minutes to remove the glass beads, then 0.5 ml phenol is 
added to the suspension. The suspension is vigorously shaken for 90 minutes at 4°C, 
and then centrifiiged at 1 8,000 x g for 1 5 minutes. The nucleic acids in the top layer 

20 are precipitated by addition of 0. 1 volume of 20% potassium acetate and 2 volumes of 
ethanol, then stored at 4°C for at least 36 hours. The precipitate is obtained by 
centrifugation at 5,000 x g for 5 minutes. Cold NaCl (1 ml) is added to the precipitate 
and stirred at 4°C for 1 hour. The suspension is centrifiiged at 15,000 x g for 30 
minutes. The sediments are washed with 0.5 ml of cold 1 M NaCl. The supernatants 

25 are combined and 2 volumes of ethanol is added to precipitate the tRNA. The 

precipitate is suspended in 0.1 ml of 0.2 M glycine, pH 10.3 and incubated for 3 hr at 
37°C. This suspension is then made 0.4 M in NaCl and the RNA is precipitated by 
addition of 2 volumes of ethanol. The precipitate is dissolved in 0.7 ml of 0.3 M 
sodium acetate, pH 7.0. To this is slowly added 0.5 volume of isopropyl alcohol, with 

30 stirring. The precipitate is removed by centrifugation at 8,000 x g for 5 min. This 
precipitate is redissolved in 0.35 ml of 0.3 M sodium acetate, pH 7.0. To this is added 
0.5 volume of isopropyl alcohol, using the same procedure as above. The precipitate 
is also removed by centrifugation. The combined supernatants from the two 
centrifugations are treated further with 0.37 ml of isopropyl alcohol. The resulting 

35 precipitate is dissolved in 75 \il of water and dialyzed against water overnight at 4°C. 
This sRNA is used in the sortase-transamidase assay. 
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Particulate sortase-transamidase enzyme is prepared for use in the 
assay by a modification of the procedure of Chatterjee & Park (A.N. Chatterjee & J.T. 
Park, Proc. Natl. Acad. Sci. USA 51:9-16 (1964)). An overnight culture of 5. aureus 
OS2 is diluted 1 :50 in TSB and incubated at 37°C for 3 hr. Cells are harvested by 
5 centrifugation at 6000 rpm for 1 5 minutes, and washed twice with ice-cold water. 
The cells are disrupted by shaking 7 ml of 1 3% suspension of cells in 0.05 M Tris- 
HC1 buffer, pH 7.5, 0.1 mM MgCl 2 , and 1 mM 2-mercaptoethanol with an equal 
volume of glass beads for 10-15 minutes in a beater. The glass beads are removed by 
centrifugation at 2000 rpm for 5 minutes. The crude extract is then centrifuged at 
10 15,000 x g for 5 minutes. The supernatant is centrifuged again at 100,000 x g for 30 
minutes. The light yellow translucent pellet is resuspended in 2 to 4 ml of 0.02 M 
Tris-HCl buffer, pH 7.5, containing 0.1 mM MgCl 2 and 1 mM 2-mercaptoethanol. 
This suspension represents the crude particulate enzyme and is used in the reaction 
mixture below. 

1 5 The supernatant from centrifugation at 1 00,000 x g is passed through 

gel filtration using a Sephadex® G-25 agarose column (Pharmacia) to remove 
endogenous substrates. This supernatant is also used in the reaction mixture. 

The complete reaction mixture contains in a final volume of 30 nl (M. 
Matsuhashi et al., Proc. Natl. Acad. Sci. USA 54: 587-594 (1965)): 3 ^imol of Tris- 

20 HC1, pH 7.8; 0.1 ^imol of MgCl 2 ; 1.3 nmol of KC1; 2.7 nmol of [ 3 H] glycine (200 
(aCi/pinol); 2 nmol of UDP-M-pentapeptide; 5 nmol of UDP-N-acetylglucosamine; 
0.2 nmol of ATP; 0.05 nmol of potassium phosphoenolpyruvate; 2.05 \ig of 
chloramphenicol; 5 fig of pyruvate kinase; 0.025 jimol of 2-mercaptoethanol; 50 ng 
of staphylococcal sRNA prepared as above; 4 jig (as protein) of supernatant as 

25 prepared above; 271 ng of particulate enzyme prepared as above; and 8 nmol of a 
synthesized soluble peptide (HHHHHHAQALEPTGEENPF) (SEQ ID NO: 32) as a 
substrate. 

The mixture is incubated at 20°C for 60 minutes. The mixture is then 
heated at 100°C for 1 minute. The mixture is diluted to 1 ml and precipitated with 50 

30 \il nickel resin, and washed with wash buffer ( 1 % Triton X- 1 00, 0. 1 % sodium 
dodecyl sulfate, 50 mM Tris, pH 7.5), The nickel resin beads are counted in a 
scintillation counter to determine 3 H bound to the beads. 

The effectiveness of the compound being screened to inhibit the 
activity of the sortase-transamidase enzyme can be determined by adding it to the 

35 assay mixture in a predetermined concentration and determining the resulting degree 
of inhibition of enzyme activity that results. Typically, a dose-response curve is 
generated using a range of concentrations of the compound being screened. 
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The particulate enzyme preparation of sortase-transamidase employed 
in this protocol can be replaced with any other sortase-transamidase preparation, - 
purified or crude, staphylococcal, recombinant, or from any other source from any 
other Gram-positive bacterium as described above. 
5 The soluble peptide is captured in this embodiment by its affinity for 

nickel resin as a result of the six histidine residues. More than six histidine residues 
can be used in the peptide. As an alternative, the soluble peptide can be captured by 
an affinity resulting from other interactions, such as streptavidin-biotin, glutathione 
S-transferase-glutathione, maltose binding protein-amylose, and the like, by 

10 replacing the six histidine residues with the amino acid sequence that constitutes the 
binding site in the peptide and employing the appropriate solid phase affinity resin 
containing the binding partner. Suitable peptides can be prepared by solid phase 
peptide synthesis using techniques well known in the art, such as those described in 
M. Bodanszky, "Peptide Chemistry: A Practical Textbook" (2d ed., Springer-Verlag, 

15 Berlin, 1993). For example, if the glutathione S-transferase-glutathione interaction is 
used, the active site of glutathione S-transferase (D.B. Smith & K.S. Johnson, 
"Single-Step Purification of Polypeptides Expressed in Escherichia coli as Fusions 
with Glutathione S-Transferase," Gene 67: 3 1 -40 ( 1 988)) can be substituted for the 
six histidine residues, and glutathione can be bound to the solid support. 

20 

IV. USE OF SORTASE-TRANSAMIDASE FOR PROTEIN AND PEPTIDE 
DISPLAY 

A. Methods for Protein and Peptide Display 

The sortase-transamidase enzyme of the present invention can also be 
25 used in a method of displaying a polypeptide on the surface of a gram-positive 
bacterium. 

In general, a first embodiment of this method comprises the steps of: 
(1) expressing a polypeptide having a sorting signal at its carboxyl-terminal end as 
described above; (2) forming a reaction mixture including: (i) the expressed 

30 polypeptide; (ii) a substantially purified sortase-transamidase enzyme; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide; and (3) allowing the sortase-transamidase to catalyze a 
reaction that cleaves the polypeptide within the LPX3X4G motif of the sorting signal 
and covalently cross-links the amino-terminal portion of the cleaved polypeptide to 

35 the peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 
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In this method, the polypeptide having the sorting signal at its 
carboxy-terminal end need not be expressed in a Gram-positive bacterium; it can be 
expressed in another bacterial system such as Escherichia coli or Salmonella 
typhimurium, or in a eukaryotic expression system. 
5 The other method for protein targeting and display relies on direct 

expression of the chimeric protein in a Gram-positive bacterium and the action of the 
sortase-transamidase on the expressed protein. In general, such a method comprises 
the steps of: (1) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 

10 carboxyl-terminal sorting signal as described above, the chimeric protein including 
the polypeptide to be displayed; (2) growing the bacterium into which the nucleic acid 
segment has been cloned to express the cloned chimeric protein to generate a chimeric 
protein including therein a carboxyl-terminal sorting signal; and (3) covalent binding 
of the chimeric protein to the cell wall by the enzymatic action of the sortase- 

15 transamidase involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the protein is displayed on the surface of the gram-positive bacterium in such a 
way that the protein is accessible to a ligand. 

Typically, the Gram-positive bacterium is a species of Staphylococcus. 
A particularly preferred species of Staphylococcus is Staphylococcus aureus. 

20 However, other Gram-positive bacteria such as Streptococcus 

pyogenes, other Streptococcus species, and Gram-positive bacteria of other genera 
can also be used. 

Cloning the nucleic acid segment encoding the chimeric protein into 
the Gram-positive bacterium is performed by standard methods. In general, such 

25 cloning involves: (1) isolation of a nucleic acid segment encoding the protein to be 
sorted and covalently linked to the cell wall; (2) joining the nucleic acid segment to 
the sorting signal; (3) cloning by insertion into a vector compatible with the Gram- 
positive bacterium in which expression is to take place; and (4) incorporation of the 
vector including the new chimeric nucleic acid segment into the bacterium. 

30 Typically, the nucleic acid segment encoding the protein to be sorted is 

DNA; however, the use of RNA in certain cloning steps is within the scope of the 
present invention. 

When dealing with genes from eukaryotic organisms, it is preferred to 
use cDNA, because the natural gene typically contains intervening sequences or 

35 introns that are not translated. Alternatively, if the amino acid sequence is known, a 
synthetic gene encoding the protein to be sorted can be constructed by standard solid- 
phase oligodeoxyribonucleotide synthesis methods, such as the phosphotriester or 
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phosphite triester methods. The sequence of the synthetic gene is determined by the 
genetic code, by which each naturally occurring amino acid is specified by one or 
more codons. Additionally, if a portion of the protein sequence is known, but the 
gene or messenger RNA has not been isolated, the amino acid sequence can be used to 
5 construct a degenerate set of probes according to the known degeneracy of the genetic 
code. General aspects of cloning are described, for example, in J, Sambrook et al., 
"Molecular Cloning: A Laboratory Manual" (2d ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York, 1989); in B. Perbal, "A Practical Guide to 
Molecular Cloning" (2d ed., John Wiley & Sons, New York 1988), in S.L. Berger & 

10 A.R. Kimmel, "Guide to Molecular Cloning Techniques" (Methods in Enzymology, 
vol. 152, Academic Press, Inc., San Diego, 1987), and inD.V. Goeddel, ed., "Gene 
Expression Technology" (Methods in Enrymology, vol. 185, Academic Press, Inc., 
San Diego, 1991). 

Once isolated, DNA encoding the protein to be sorted is then joined to 

15 the sorting signal. This is typically accomplished through ligation, such as using 

Escherichia coli or bacteriophage T4 ligase. Conditions for the use of these enzymes 
are well known and are described, for example, in the above general references. 

The ligation is done in such a way so that the protein to be sorted and 
the sorting signal are joined in a single contiguous reading frame so that a single 

20 protein is produced. This may, in some cases, involve addition or deletion of bases of 
the cloned DNA segment to maintain a single reading frame. This can be done by 
using standard techniques. 

Cloning is typically performed by inserting the cloned DNA into a 
vector containing control elements to allow expression of the cloned DNA. The 

25 vector is then incorporated into the bacterium in which expression is to occur, using 
standard techniques of transformation or other techniques for introducing nucleic 
acids into bacteria. 

One suitable cloning system for & aureus places the cloned gene under 
the control of the BlaZRI regulon (P.Z. Wang et al., Nucl. Acids Res. 19:4000 

30 (1991)). Vectors and other cloning techniques for use in Staphylococcus aureus are 
described in B. Nilsson & L. Abrahmsen, "Fusion to Staphylococcal Protein A," in 
Gene Expression Technology, supra , p. 144-1 61. 

If the chimeric protein is cloned under control of the BlaZRI regulon, 
expression can be induced by the addition of the p-lactam antibiotic methicillin. 

35 Another aspect of the present invention is a polypeptide displayed on 

the surface of a Gram-positive bacterium by covalent linkage of an amino-acid 
sequence of LPX3X4 derived from cleavage of an LPX3X4G motif, as described above. 
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Yet another aspect of the present invention is a covalent complex 
comprising: (1) the displayed polypeptide; and (2) an antigen or hapten covalently 
cross-linked to the polypeptide. 



5 B. Screening Methods 

These polypeptides associated with the cell surfaces of Gram-positive 
bacteria can be used in various ways for screening. For example, samples of 
expressed proteins from an expression library containing expressed proteins on the 
surfaces of the cells can be used to screen for clones that express a particular desired 
10 protein when a labeled antibody or other labeled specific binding partner for that 
protein is available. 

These methods are based on the methods for protein targeting and 
display described above. 

A first embodiment of such a method comprises: (1) expressing a 
15 cloned polypeptide as a chimeric protein having a sorting signal at its carboxy- 
terminal end as described above; (2) forming a reaction mixture including: (i) the 
expressed chimeric protein; (ii) a substantially purified sortase-transamidase enzyme; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; (3) binding of the 
20 chimeric protein covalently to the cell wall by the enzymatic action of a sortase- 
transamidase expressed by the Gram-positive bacterium involving cleavage of the 
chimeric protein within the LPX3X4G motif so that the polypeptide is displayed on the 
surface of the Gram-positive bacterium in such a way that the polypeptide is 
accessible to a ligand; and (4) reacting the displayed polypeptide with a labeled 
25 specific binding partner to screen the chimeric protein for reactivity with the labeled 
specific binding partner. 

The nucleic acid segment encoding the chimeric protein is formed by 
methods well known in the art and can include a spacer. 

In the last step, the cells are merely exposed to the labeled antibody or 
30 other labeled specific binding partner, unreacted antibodies removed as by a wash, and 
label associated with the cells detected by conventional techniques such as 
fluorescence, chemiluminescence, or autoradiography. 

A second embodiment of this method employs expression in a Gram- 
positive bacterium that also produces a sortase-transamidase enzyme. This method 
35 comprises: (1) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 
carboxyl-terminal sorting signal as described above, the chimeric protein including 
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the polypeptide whose expression is to be screened; (2) growing the bacterium into 
which the nucleic acid segment has been cloned to express the cloned chimeric protein 
to generate a chimeric protein including therein a carboxyl-terminal sorting signal; (3) 
binding the polypeptide covalently to the cell wall by the enzymatic action of a 
5 sortase-transamidase expressed by the Gram-positive bacterium involving cleavage 
of the chimeric protein within the LPX3X4G motif so that the polypeptide is displayed 
on the surface of the Gram-positive bacterium in such a way that the polypeptide is 
accessible to a ligand; and (4) reacting the displayed polypeptide with a labeled 
specific binding partner to screen the chimeric protein for reactivity with the labeled 
1 0 specific binding partner. 

V. USE OF SORTED MOLECULES FOR DIAGNOSIS AND TREATMENT OF 
BACTERIAL INFECTIONS 

Sorted molecules can also be used for the diagnosis and treatment of 

15 bacterial infections caused by Gram-positive bacteria. Antibiotic molecules or 
fluorescent or any other diagnostic molecules can be chemically linked to a sorted 
peptide segment, which may include a spacer as described above, and then can be 
injected into animals or humans. These molecules are then sorted by the sortase- 
transamidase so that they are covalently linked to the cell wall of the bacteria. 

20 In general, these methods comprise: (1) conjugating an antibiotic or a 

detection reagent to a protein including therein a carboxyl-terminal sorting signal to 
produce a conjugate; and (2) introducing the conjugate to an organism infected with a 
Gram-positive bacterium in order to cause the conjugate to be sorted and covalently 
cross-linked to the cell walls of the bacterium in order to treat or diagnose the 

25 infection. 

The antibiotic used can be, but is not limited to, a penicillin, ampicillin, 
vancomycin, gentamicin, streptomycin, a cephalosporin, amikacin, kanamycin, 
neomycin, paromomycin, tobramycin, ciprofloxacin, clindamycin, rifampin, 
chloramphenicol, or norfloxacin, or a derivative of these antibiotics. 

30 The detection reagent is typically an antibody or other specific binding 

partner labeled with a detectable label, such as a radiolabel. Such methods are well 
known in the art and need not be described further here. 

Accordingly, another aspect of the present invention is a conjugate 
comprising an antibiotic or a detection reagent covalently conjugated to a protein 

35 including therein a carboxyl-terminal sorting signal as described above to produce a 
conjugate. 
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Yet another aspect of the present invention is a composition 
comprising the conjugate and a pharmaceutically acceptable carrier. 

In this context, the conjugates can be administered using conventional 
modes of administration, including, but not limited to, intravenous, intraperitoneal, 
5 oral, or intralymphatic. Other routes of administration can alternatively be used. Oral 
or intraperitoneal administration is generally preferred. The composition can be 
administered in a variety of dosage forms, which include, but are not limited to, liquid 
solutions or suspensions, tablets, pills, powders, suppositories, polymeric 
microcapsules or microvesicles, liposomes, and injectable or infusible solutions. The 

1 0 preferred form depends on the mode of administration and the quantity administered. 

The compositions for administration preferably also include 
conventional pharmaceutically acceptable carriers and adjuvants known in the art such 
as human serum albumin, ion exchangers, alumina, lecithin, buffered substances such 
as phosphate, glycine, sorbic acid, potassium sorbate, and salts or electrolytes such as 

15 protamine sulfate. The most effective mode of administration and dosage regimen for 
the conjugates as used in the methods in the present invention depend on the severity 
and course of the disease, the patient's health, the response to treatment, the particular 
strain of bacteria infecting the patient, other drugs being administered and the 
development of resistance to them, the accessibility of the site of infection to blood 

20 flow, pharmacokinetic considerations such as the condition of the patient's liver 
and/or kidneys that can affect the metabolism and/or excretion of the administered 
conjugates, and the judgment of the treating physician. According, the dosages should 
be titrated to the individual patient. 

25 VI. USE OF SORTED POLYPEPTIDES FOR PRODUCTION OF VACCINES 

Additionally, the sorted polypeptides covalently crosslinked to the cell 
walls of Gram-positive bacteria according to the present invention have a number of 
uses. One use is use in the production of vaccines that can be used to generate 
immunity against infectious diseases affecting mammals, including both human and 

30 non-human mammals, such as cattle, sheep, and goats, as well as other animals such 
as poultry and fish. This invention is of special importance to mammals. The 
usefulness of these complexes for vaccine production lies in the fact that the proteins 
are on the surface of the cell wall and are accessible to the medium surrounding the 
bacterial cells, so that the antigenic part of the chimeric protein is accessible to the 

35 antigen processing system. It is well known that presenting antigens in particulate 
form greatly enhances the immune response. In effect, bacteria containing antigenic 
peptides on the surfaces linked to the bacteria by these covalent interactions function 
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as natural adjuvants. Here follows a representative list of typical microorganisms that 
express polypeptide antigens against which useful antibodies can be prepared by the 
methods of the present invention: 

(1) Fungi: Candida albicans, Aspergillus fumigatus, Histoplasma 

5 capsulatum (all cause disseminating disease), Microsporum canis (animal ringworm). 

(2) Parasitic protozoa: (1) Plasmodium falciparum (malaria), 
Trypanosoma cruzei (sleeping sickness). 

(3) Spirochetes: (1) Borrelia bergdorferi (Lyme disease), Treponema 
pallidum (syphilis), Borrelia recurrentis (relapsing fever), Leptospira 

1 0 icterohaemorrhagiae Leptospirosis). 

(4) Bacteria: Neisseria gonorrhoeae (gonorrhea), Staphylococcus 
aureus (endocarditis), Streptococcus pyogenes (rheumatic fever), Salmonella typhosa 
(salmonellosis), Hemophilus influenzae (influenza), Bordetella pertussis (whooping 
cough), Actinomyces israelii (actinomycosis), Streptococcus mutans (dental caries), 

15 Streptococcus equi (strangles in horses), Streptococcus agalactiae (bovine mastitis), 
Streptococcus anginosus (canine genital infections). 

(5) Viruses: Human immunodeficiency virus (HIV), poliovirus, 
influenza virus, rabies virus, herpes virus, foot and mouth disease virus, psittacosis 
virus, paramyxovirus, myxovirus, coronavirus. 

20 Typically, the resulting immunological response occurs by both 

humoral and cell-mediated pathways. One possible immunological response is the 
production of antibodies, thereby providing protection against infection by the 
pathogen. 

This method is not limited to protein antigens. As discussed below. 

25 non-protein antigens or haptens can be covalently linked to the C-terminal cell-wall 
targeting segment, which can be produced as an independently expressed polypeptide, 
either alone, or with a spacer at its amino-terminal end. If a spacer at the amino- 
terminal end is used, typically the spacer will have a conformation allowing the 
efficient interaction of the non-protein antigen or hapten with the immune system ; 

30 most typically a random coil or cc-helical form. The spacer can be of any suitable 
length; typically, it is in the range of about 5 to about 30 amino acids; most typically, 
about 10 to about 20 amino acids. In this version of the embodiment, the 
independently expressed polypeptide, once expressed, can then be covalently linked to 
the hapten or non-protein antigen. Typical non-protein antigens or haptens include 

35 drugs, including both drugs of abuse and therapeutic drugs, alkaloids, steroids, 

carbohydrates, aromatic compounds, including many pollutants, and other compounds 
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that can be covalently linked to protein and against which an immune response can be 
raised. 

Alternatively, a protein antigen can be covalently linked to the 
independently expressed cell-wall targeting segment or a cell-wall targeting segment 
5 including a spacer. 

Many methods for covalent linkage of both protein and non-protein 
compounds to proteins are well known in the art and are described, for example, in P. 
Tijssen, "Practice and Theory of Enzyme Immunoassays" (Elsevier, Amsterdam, 
1985), pp. 221-295, and in S.S. Wong, "Chemistry of Protein Conjugation and 

10 Cross-Linking" (CRC Press, Inc., Boca Raton, FL, 1 993). 

Many reactive groups on both protein and non-protein compounds are 
available for conjugation. 

For example, organic moieties containing carboxyl groups or that can 
be carboxylated can be conjugated to proteins via the mixed anhydride method, the 

15 carbodiimide method, using dicyclohexylcarbodiimide, and the N- 
hydroxysuccinimide ester method. 

If the organic moiety contains amino groups or reducible nitro groups 
or can be substituted with such groups, conjugation can be achieved by one of several 
techniques. Aromatic amines can be converted to diazonium salts by the slow 

20 addition of nitrous acid and then reacted with proteins at a pH of about 9. If the 

organic moiety contains aliphatic amines, such groups can be conjugated to proteins 
by various methods, including carbodiimide, tolylene-2,4-diisocyanate, ormalemide 
compounds, particularly the N-hydroxysuccinimide esters of malemide derivatives. 
An example of such a compound is 4-(N-maleimidomethyl)-cyclohexane-l- 

25 carboxylic acid. Another example is m-maleimidobenzoyl-N-hydroxysuccinimide 
ester. Still another reagent that can be used is N-succinimidyl-3-(2-pyridyldithio) 
propionate. Also, Afunctional esters, such as dimethylpimelimidate, 
dimethyladipimidate, or dimethylsuberimidate, can be used to couple amino-group- 
containing moieties to proteins. 

30 Additionally, aliphatic amines can also be converted to aromatic 

amines by reaction with p-nitrobenzoylchloride and subsequent reduction to ag- 
aminobenzoylamide, which can then be coupled to proteins after diazotization. 

Organic moieties containing hydroxyl groups can be cross-linked by a 
number of indirect procedures. For example, the conversion of an alcohol moiety to 

35 the half ester of succinic acid (hemisuccinate) introduces a carboxyl group available 
for conjugation. The Afunctional reagent sebacoyldichloride converts alcohol to acid 
chloride which, at pH 8.5, reacts readily with proteins. Hydroxy 1-containing organic 
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moieties can also be conjugated through the highly reactive chlorocarbonates, 
prepared with an equal molar amount of phosgene. 

For organic moieties containing ketones or aldehydes, such carbonyl- 
containing groups can be derivatized into carboxyl groups through the formation of 
5 O-(carboxymethyl) oximes. Ketone groups can also be derivatized with p- 
hydrazinobenzoic acid to produce carboxyl groups that can be conjugated to the 
specific binding partner as described above. Organic moieties containing aldehyde 
groups can be directly conjugated through the formation of Schiff bases which are 
then stabilized by a reduction with sodium borohydride. 

10 One particularly useful cross-linking agent for hydroxyl-containing 

organic moieties is a photosensitive noncleavable heterobifunctional cross-linking 
reagent, sulfosuccinimidyl 6-[4'-azido-2'-nitrophenylamino] hexanoate. Other 
similar reagents are described in S.S. Wong, "Chemistry of Protein Conjugation and 
Cross-Linking, " supra . 

1 5 Other cross-linking reagents can be used that introduce spacers 

between the organic moiety and the specific binding partner. 

These methods need not be described further here. 

VII. PRODUCTION OF SUBSTANTIALLY PURIFIED SORTASE- 
20 TRANSAMIDASE ENZYME 

Another aspect of the present invention is methods for the production 
of substantially purified sortase-transamidase enzyme. 

A. Methods Involving Expression of Cloned Gene 
25 One method for the production of substantially purified sortase- 

transamidase enzyme involves the expression of the cloned gene. The isolation of the 
nucleic acid segment or segments encoding the sortase-transamidase enzyme is 
described above; these nucleic acid segment or segments are then incorporated into a 
vector and then use to transform a host in which the enzyme can be expressed. In one 
30 alternative, the host is a Gram-positive bacterium. 

The next step in this alternative is expression in a Gram-positive 
bacterium to generate the cloned sortase-transamidase enzyme. Expression is 
typically under the control of various control elements associated with the vector 
incorporating the DNA encoding the sortase-transamidase gene; such elements can 
35 include promoters and operators, which can be regulated by proteins such as 
repressors. The conditions required for expression of cloned proteins in gram- 
positive bacteria, particularly S. aureus, are well known in the art and need not be 
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further recited here. An example is the induction of expression of lysostaphin under 
control of the BlaZRI regulon induced by the addition of methicillin. 

When expressed in Staphylococcus aureus, the chimeric protein is 
typically first exported with an amino-terminal leader peptide, such as the 
5 hydrophobic signal peptide at the amino-terminal region of the cloned lysostaphin of 
Recsei et al. (P. Recsei et aL, "Cloning, Sequence, and Expression of the Lysostaphin 
Gene from Staphylococcus simulans? Proc. Natl. Acad. Sci. USA 84:1 127-1 131 
(1987)). 

Alternatively, the cloned nucleic acid segment encoding the sortase- 
10 transamidase enzyme can be inserted in a vector that contains sequences allowing 
expression of the sortase-transamidase in another organism, such as E. coli or S. 
typhimurium. A suitable host organism can then be transformed or transfected with 
the vector containing the cloned nucleic acid segment. Expression is then performed 
in that host organism. 

1 5 The expressed enzyme is then purified using standard techniques. 

Techniques for the purification of cloned proteins are well known in the art and need 
not be detailed further here. One particularly suitable method of purification is 
affinity chromatography employing an immobilized antibody tosortase. Other protein 
purification methods include chromatography on ion-exchange resins, gel 

20 electrophoresis, isoelectric focusing, and gel filtration, among others. 

One particularly useful form of affinity chromatography for 
purification of cloned proteins, such as sortase-transamidase, as well as other 
proteins, such as glutathione S-transferase and thioredoxin, that have been extended 
with carboxyl-terminal histidine residues, is chromatography on a nickel-sepharose 

25 column. This allows the purification of a sortase-transamidase enzyme extended at 
its carboxyl terminus with a sufficient number of histidine residues to allow specific 
binding of the protein molecule to the nickel-sepharose column through the histidine 
residues. The bound protein is then eluted with imidazole. Typically, six or more 
histidine residues are added; preferably, six histidine residues are added. One way of 

30 adding the histidine residues to a cloned protein, such the sortase-transamidase, is 
through PCR with a primer that includes nucleotides encoding the histidine residues. 
The histidine codons are CAU and CAC expressed as RNA, which are CAT and CAC 
as DNA. Amplification of the cloned DNA with appropriate primers will add the 
histidine residues to yield a new nucleic acid segment, which can be recloned into an 

35 appropriate host for expression of the enzyme extended with the histidine residues. 
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B. Other Methods 

Alternatively, the sortase-transamidase can be purified from Gram- 
positive bacteria by standard methods, including precipitation with reagents such as 
ammonium sulfate or protamine sulfate, ion-exchange chromatography, gel filtration 
5 chromatography, affinity chromatography, isoelectric focusing, and gel 
electrophoresis, as well as other methods known in the art. 

Because the sortase-transamidase is a cysteine protease, one 
particularly useful method of purification involves covalent chromatography by thiol- 
disulfide interchange, using a two-protonic-state gel containing a 2- 
10 mercaptopyridine leaving group, such as Sepharose 2B-glutathione 2-pyridyl 
disulfide or Sepharose 6B-hydroxypropyl 2-pyridyl disulfide. Such covalent 
chromatographic techniques are described in K. Brocklehurst et al., "Cysteine 
Proteases," in New Comprehensive Biochemistry. Volume 16: Hvdrolvtic Enzvmes 
(A. Neuberger & K. Brocklehurst, eds., Elsevier, New York, 1987), ch. 2, pp. 39-158. 

15 

VIII. FURTHER APPLICATIONS OF SORTASE-TRANSAMIDASF 

A. Production of Antibodies 

Antibodies can be prepared to the substantially purified sortase- 
transamidase of the present invention, whether the sortase-transamidase is purified 

20 from bacteria or produced from recombinant bacteria as a result of gene cloning 
procedures. Because the substantially purified enzyme according to the present 
invention is a protein, it is an effective antigen, and antibodies can be made by well- 
understood methods such as those disclosed in E. Harlow & D. Lane, "Antibodies: A 
Laboratory Manual" (Cold Spring Harbor Laboratory, 1988). In general, antibody 

25 preparation involves immunizing an antibody-producing animal with the protein, with 
or without an adjuvant such as Freund's complete or incomplete adjuvant, and 
purification of the antibody produced. The resulting polyclonal antibody can be 
purified by techniques such as affinity chromatography. 

Once the polyclonal antibodies are prepared, monoclonal antibodies 

30 can be prepared by standard procedures, such as those described in Chapter 6 of 
Harlow & Lane, supra . 

B. Derivatives for Affinity Chromatography 

Another aspect of the present invention is derivatives of the cloned, 
35 substantially purified sortase-transamidase of the present invention extended at its 
carboxyl terminus with a sufficient number of histidine residues to allow specific 
binding of the protein molecule to a nickel-sepharose column through the histidine 



WO 99/09145 



PCT/US98/16229 



-34- 

residues. Typically, six or more histidine residues are added; preferably, six histidine 
residues are added. 

The histidine residues can be added to the carboxyl terminus through 
PCR cloning as described above. 
5 This invention is further described by means of the following example. 

This Example is for illustrative purposes only, and are not to be construed as limiting 
the scope of the invention in any manner. 

EXAMPLE 

10 Identification of Sortase-Transamidase 

Generation of ts Mutants through Chemical Mutagenesis 

To create random mutations in the chromosome, Staphylococcus 
aureus strain OS2 (RN4220 erm spa-) was mutagenized by exposure to the DNA- 
modifying agent N-methyl-N-nitro-N-nitrosoguanidine. Cultures were incubated 

15 with the mutagen for varying periods of time, then placed on TSB agar plates to 

measure viability. Cultures were subsequently plated on TSB+ rifampicin (10 pg/ml) 
to determine the mutation frequency based on resistance to the single target site 
antibiotic. Once a maximum mutation frequency was reached, cell cultures were 
exposed to two successive rounds of a penicillin selection (5 \\%lm\ at 42°C) to enrich 

20 for mutants that had a growth effect by lysis of the cells growing at this temperature. 
Mutants were screened for growth at 30°C and 42°C by streaking individual colonies 
on TSB agar plates at the permissive and non-permissive temperatures, respectively. 
These colonies that demonstrated a growth defect at the non-permissive temperature 
were rechecked at 42°C, and subsequently frozen at -80°C in a 5% BSA, 5% 

25 monosodium glutamate (MSG) solution. In this manner, a collection of temperature- 
sensitive mutants was assembled. 

Transformation and Screening of ts Mutants 

In order to isolate mutants that demonstrated a defect in the surface 

30 display of protein, it was necessary to develop a screening process to locate these 
strains. Previous studies that had indicated a typical secretory-dependent process, in 
conjunction with known C-terminal cleavage of translocated proteins, were used to 
elucidate a selection scheme to isolate the desirable mutants. The construct harboring 
the staphylococcal enterotoxin B (SEB) gene fused to the cell wall sorting signal of 

35 staphylococcal protein A (SPA) was used in this assay. This reporter molecule has 
been shown to be properly processed not only by the secretoiy machinery and through 
signal peptidase cleavage of an N-terminal secretion signal, but also to be correctly 
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sorted after secretion to its peptidoglycan substrate. Mutant cells were made 
competent using 0.5 M sucrose washes, and were transformed with the 
staphylococcal/Escherichia coli shuttle vector pOS 1 containing the SEB-SPA 
reporter through electroporation. Transformants were selected by virtue of their 
5 resistance to chloramphenicol as encoded by the plasmid. The cells were then 

screened for properties indicative of a defect in the processing of precursor molecules 
to a fully matured and anchored surface protein. 

Cultured cells were induced at 42 C C, and then pulsed with S-35 
ProMix (80% Met, 20% Cys), in order to label all synthesized proteins. Samples were 

10 precipitated through acid treatment, then digested with lysostaphin (100 fig/ml) and 
subsequently reprecipitated. This was followed by solubilization in hot sodium 
dodecyl sulfate (4%), and an immunoprecipitation with anti-SEB antibodies. 
Samples were run on sodium dodecyl sulfate-polyacrylamide gel electrophoresis and 
finally exposed to phosphorimager evaluation. The resulting banding patterns were 

15 analyzed and quantitated. 

This procedure yielded three anti-SEB reacting species, termed PI , P2 
and M. PI , the largest precursor which migrated to 33 kDa, represents the complete 
gene product encoded by the SEB-SPA construct, with no modifications. P2, the 
second precursor, most likely represents the product after cleavage of the amino- 

20 terminal signal peptide found in SEB, and thus migrates to 32 kDa. The smallest 
species, M, is a lysostaphin-solubilized, maturely anchored peptide that has neither 
the signal peptide nor the remainder of the carboxyl-terminal sorting signal after the 
cleavage. This band migrates at approximately 29 kDa. 

Analysis of these species was conducted through phosphorimager 

25 quantitation, and mutants were selected based upon the proposed phenotype that the 
inhibition in sorting would result in an accumulation of P2 and a reduction in the 
production of M. Through this process, two mutants, SM317 and SM329, were 
earmarked for further analysis due to their elevated ratio of P2/M over wild-type. 

Both mutants finally demonstrated an accumulation of P2 after a 5 min 

30 kinetic analysis, but also clearly showed a decrease in the mutant's ability to degrade 
the species over time as measured by samples chased with cold methionine. These 
results were interpreted to mean that the ability of the mutants to process mature cell 
wall anchored peptides was impaired, quite possibly due to the less efficient activity 
of the sortase-transamidase enzyme. 

35 Protoplasts were made of these mutants in order to cure them of their 

plasmids encoding the reporter construct, and subsequently retransformed with the 
SEB/SPA containing plasmid to once again test for preservation of this phenotype. 
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Upon a favorable result, the mutants were prepared for complementation by a S. 
aureus chromosomal library. 

Generation of an S. aureus Chromosomal Library and Complementation of ts Defect 
5 A staphylococcal library was made through a Sau3 AI digest of the 

chromosomal DNA preparation from 5. aureus strain RN4220. DNA was isolated 
through a phenol-chloroform extraction from lysed cells, and digested for various 
times until the correct partial digest pattern was observed. Fragments greater than 2.5 
kb were inserted into the BamHI cloning site in the multi-cloning sequence (MCS) of 
10 plasmid pC 1 94-MCS. This heterogeneous mixture of plasmids was then transformed 
into competent OS2 cells. Approximately 15,000 clones were harvested. DNA was 
prepared and transformed into competent cells made of both mutants, and 
simultaneously plated at 30°C and 42°C to screen for complementation of the ts 
mutant phenotype. 

15 Through this process, four chromosomal inserts from each mutant were 

found to complement the ts phenotype by conferring growth at 42°C. Due to the 
nature of the mutagenesis, it is at this point necessary to demonstrate definitively that 
the ts defect is somehow linked to the defect in processing. This is done through 
illustration that the plasmids harboring the chromosomal inserts not only complement 

20 the temperature sensitivity, but also relieve the accumulation of P2 at the expense of 
M in these mutants. Therefore, the complemented mutants were screened along with 
the non-complemented versions against wild type OS2. 

Screen for Sorting Defect Complementation 

25 This assay was conducted differently for each of the two mutants. 

SM-31 7 was screened by the insertion of the SEB/SPA fragment aboard the 
replication defective pCL 84 vector that possessed integration capability into the 
chromosome of & aureus cells. The site-specific integration, mediated by the 
integrase gene supplied in trans by pCLl 12, disrupts the lipase gene, which can be 

30 assayed for by the lack of hydrolysis on egg yolk agar plates. Once successfully 

integrated, the RN4220 chromosomal fragments that complement the ts mutation can 
be added to make the cells ready for screening. 

SM329 was assayed by another approach. The pCl 94 plasmid 
harboring the complementing stretch of DNA was fused to an E. coli replicon 

35 pHSG399 that contained the SEB/SPA gene. The shuttle vector thus provided both a 
reporter substrate as well as a complementing activity. 
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The result of this screen demonstrated that over several time periods of 
the pulse-chase, the complementing insert added to the mutant reduced the accretion 
of P2 from the elevated level characteristic of the mutant to the wild type range in 
both mutants, the more dramatically in SM-329. This may be due to the fact that the 
5 substrate for sorting in SM-3 1 7 is found in only one copy per cell, whereas in SM- 
329, the reporter aboard pC194 is present at approximately 15 copies. Nevertheless, 
these results indicate that these mutants are in the sortase-transamidase gene and the 
sequencing of both chromosomal inserts was therefore undertaken. It should be 
pointed out that the complementing activity of each of the respective mutants was not 
10 transferable to the other, neither in terms of temperature sensitivity nor for the 

processing defect. Also, the four complementing clones isolated from each mutant 
seemed to behave in an identical manner, and also seemed to possess very similar 
restriction sites when digested with various specific endonucleases. Therefore, one 
clone was chosen from each mutant for sequencing. 

15 

Sequencing and Characterization of the S. aureus Complementing Determinants 

The chromosomal inserts carrying the sorting defect complementing 
capabilities were sequenced. This work was done completely by automated sequence 
analysis using dideoxyribonucleotides. Sequence data was confirmed by duplicate 

20 analysis of both strands of DNA. Comparison was done to all known nucleotide and 
protein sequence was currently found in the GenBank service. The partial, crude 
sequence of the S. aureus gene is shown in Figure 10 (SEQ ID NO: 28 & 29). The 
partial carboxy-terminal amino acid sequence of the open reading frame generated 
from the gene sequence of Figure 10 (SEQ ID NO: 28 & 29) is shown in Figure 1 1 

25 (SEQ ID NO: 2). 

Several stretches of high homology were found, to both known and 
putative proteins of varying function. The first 364 bases of the SM-3 17 
complementing gene insert been identified as encoding a protein that is a homologue 
of a putative Bacillus peptidase in the GCVT-SPOIIIAA intergenic region (GenBank 

30 Accession No. 1 73 1 048; Y. Kobayashi et al.). The sequence of this putative peptidase 
is shown in Figure 5 (SEQ ID NO: 3) and its hydrophobicity profile is shown in 
Figure 6. The hydrophobicity is calculated according to the method of J. Kyte & R.F. 
Doolittle, "A Simple Method for Displaying the Hydropathic Character of a Protein," 
J. Mol. Biol. 157: 105-132 (1982). To a lesser degree of homology, the protein 

35 encoded by this complementing gene insert is homologous to aminopeptidase P of 
Lactococcus lactis (GenBank Accession No. 1915907; J. Matos). The amino acid 
sequence of this aminopeptidase is shown in Figure 7 (SEQ ID NO: 4). To a still 
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lesser degree of homology, the protein encoded by this complementing gene insert is 
homologous to the proline dipeptidase of Lactobacillus delbrueckii lactis (GenBank 
Accession No. 1 172066; K. Stucky et al., "Cloning and DNA Sequence Analysis of 
pepQ, a Prolidase Gene from Lactobacillus delbrueckii subsp. lactis and Partial 

5 Characterization of Its Product," Mol. Gen. Genet. 247: 494-500 (1995). The amino 
acid sequence of this proline dipeptidase is shown in Figure 8 (SEQ ID NO: 5). 

An additional complementing gene insert, in a vector designated 
pCOMPl, has also been sequenced. The DNA sequence of this complementing gene 
insert is shown in Figure 12 (SEQ ID NO: 30), together with the amino acid sequence 

10 (SEQ ID NO: 3 1) of the protein translated from this DNA sequence, and the 

hydrophobicity profile of the protein translated from the DNA sequence is shown in 
Figure 13. The amino acid sequence of the protein translated from the sequence of 
Figure 12 (SEQ ID NOS: 30 & 3 1) has virtually no homology with the amino acid 
sequence of the protein shown in Figure 1 1 (SEQ ID NO: 2). In particular, the amino 

15 acid sequence of the protein of Figure 12 (SEQ ID NOS: 30 & 31) has a single 
cysteine residue. 

Although Applicants do not intend to be bound by this theory, the 
existence of these two complementing inserts that define different polypeptide 
sequences suggests that the sortase-transamidase enzyme of S. aureus is a 

20 heterooligomer, with two or more different subunits that have different amino acid 
sequences. Mutations in at least two different subunits can give rise to the 
temperature sensitive phenotype and can then be complemented for. Alternatively, the 
synthesis of the peptidoglycan may require additional enzymes. 

Upon completion of the sequencing, and a study of all open reading 

25 frames, candidate genes were selected for further analysis. These genes were 
expressed in the mutants to determine if they complement both the ts and sorting 
defects. Upon success in this capacity, the genes were disrupted in wild-type S. 
aureus to determine their essentiality and possible biological roles. 

30 Materials and Methods 

Mutagenesis of S. aureus Strain OS2 . S, aureus strain OS2 (RN4220 

erm spa-) was mutagenized by treatment with N-methyl-N-nitro-N- 

nitrosoguanidine at 2 mg/ml. Culture OD was measured at 660 nm for viability. 

After 45 min, cultures were spun down at 4,000 rpm for 10 min, and washed with 
35 citrate buffer, pH 5.5. Cells were resuspended in citrate buffer to a concentration of 

5xl0 8 cells/ml. These cultures were serially diluted in TSB (tryptic soy broth) and 
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plated on TSB+ rifampicin (10 jag/ml) agar plates. Mutation frequencies were 
determined to be 5x 1 0 5 mutation/cfii. 

Enrichment and Selection of ts Mutants . A 1 : 100 dilution of cell 
cultures grown overnight at 30°C was added to TSB, and allowed to grow for 2 hr, at 
5 which time penicillin G was added at 5 jig/ml. Culture viability was measured by 
taking OD readings at 660 nm at various times until the concentration dropped to a 
stable point. Cells were washed twice in TSB, and the enrichment was repeated a 
second time. 

After two successive rounds of penicillin selection for growth arrest at 

10 42°C, individual colonies were picked and simultaneously streaked on duplicate TSB 
plates incubated at 30°C and 42°C. Those colonies that had a growth defect at 42°C 
were rechecked at this nonpermissive temperature, and stored at -80°C in a 5% BSA 
5% MSG solution. 

Transformation of Competent Cells . Mutant cells were made 

15 competent by diluting overnight cultures 1 : 1 0 in TSB, and growing to 0.3 OD at 660 
nm. Cells were then spun down at 7,500 rpm for 15 min and resuspended in an equal 
volume of 0.5 M sucrose. After another pelleting, cells were resuspended in 0.5 
volume sucrose, and incubated for 30 min at 4°C. Following another spin, cells were 
brought up in 0.1 volume sucrose. These competent cells were then transformed with 

20 the appropriate plasmids encoding chloramphenicol (10 jig/ml) resistance by 

electroporation at 200 ohms, 25 jxF, and 2.5 kV in 0.2 cm cuvettes. Cells were plated 
on TSB plus chloramphenicol incubated at 30°C. Transformed strains were frozen in 
BSA/MSG at -80°C. 

Pulse-Chase Screen of Mutants . Strains were inoculated in chemically 

25 defined media with chloramphenicol and grown overnight at 30°C. Cultures were 
diluted 1:10 into medium IV (O. Schneewind et al., "Sorting of Protein A to the 
Staphylococcal Cell Wall/' Cell 70: 267-281 (1992)), grown for 3 hr, and then 
induced at 42°C for 20 min. At this time, cultures were pulse labeled with 50 jxCi of 
S-35 ProMix for 5 min, and then terminated with 5% trichloroacetic acid (TCA). 

30 Cells were incubated at 4° for 30 min, centrifuged at 12,500 rpm for 15 min, and the 
supernatants aspirated. After resuspension in acetone, cells were spun again and 
aspirated to dryness. At this time, cells were treated with lysostaphin (100 jig/ml) for 
30 min or until noticeable clearing, and subjected to another TCA/acetone 
precipitation. After lysis of cells by boiling for 10 min with 4% SDS in 0.5m Tris, pH 

35 8.0, proteins were immunoprecipitated with anti-SEB for 1 hr and protein A- 

Sepharose beads.for another 1 hr. Samples were washed three times in RIPA buffer, 
pH 8.0 (300 mM NaCl, 2% Triton X-100, 1% deoxycholate, 0.2% SDS), and protein 
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was solubilized with SDS-urea sample buffer (50 mM Tris-HCl, pH 6.8, 3% SDS, 
5% 2-mercaptoethanol, 3.5 M urea), with boiling for 10 min. Samples were run on 
SDS-polyacrylamide gel electrophoresis gels and exposed to aphosphorimager screen 
overnight. Quantitations were done on ImageQuant software. 
5 DNA Sequencing . DNA was sequenced on a Perkin-Elmer automated 

sequencer after PCR using dye-terminating ready reaction mixed with SS-Taq 
polymerase. GenBank analysis was done using BLAST software to search the 
database. 

10 ADVANTAGES OF THE PRESENT INVENTION 

In isolating and characterizing the gene for the S. aureus sortase- 
transamidase enzyme, we have determined the existence of a new site for antibiotic 
action that can be used to screen new antibiotics active against Gram-positive 
pathogens, such as Staphylococcus, Actinomyces, Mycobacterium, Streptococcus, 

15 Bacillus, and other medically important Gram-positive pathogens increasingly 
resistant to conventional antibiotics. The availability of substantially purified S. 
aureus sortase-transamidase enzyme provides a method of screening compounds for 
inhibition of the enzyme. 

The purified sortase-transamidase enzyme of the present invention 

20 also yields a method of surface display of peptides and proteins that has advantages 
over phage display, as well as providing methods for producing vaccines against a 
large variety of antigens that can be covalently bound to the surfaces of Gram- 
positive bacteria. 

Although the present invention has been described with considerable 
25 detail, with reference to certain preferred versions thereof, other versions and 

embodiments are possible. Therefore, the scope of the invention is determined by the 
following claims. 
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We claim: 



1 . A substantially purified sortase-transamidase enzyme from a 

5 Gram-positive bacterium, the enzyme catalyzing a reaction that covalently cross- 
links the carboxyl terminus of a protein having a sorting signal to the peptidoglycan of 
a Gram-positive bacterium, the sorting signal having a motif of LPX3X4G therein, 
wherein sorting occurs by cleavage between the fourth and fifth residues of the 
LPX3X4G motif. 

10 

2. The substantially purified sortase-transamidase enzyme of claim 1 
wherein the Gram-positive bacterium is a species selected from the group consisting 
of Staphylococcus aureus, S. sobrinus, Enterococcus faecalis, Streptococcus 
pyogenes, and Listeria monocytogenes, 

15 

3. The substantially purified sortase-transamidase enzyme of claim 2 
wherein the Gram-positive bacterium is Staphylococcus aureus. 

4. The substantially purified sortase-transamidase enzyme of claim 1 
20 wherein one subunit of the enzyme has a molecular weight of about 41 ,000 daltons. 

5. The substantially purified sortase-transamidase enzyme of claim 4 
wherein the sorting signal further comprises:(2) a substantially hydrophobic domain of 
at least 31 amino acids carboxyl to the motif; and (3) a charged tail region with at least 

25 two positively charged residues carboxyl to the substantially hydrophobic domain, at 
least one of the two positively charged residues being arginine, the two positively 
charged residues being located at residues 31-33 from the motif, wherein X 3 is any of 
the twenty naturally-occurring L-amino acids and X4 is selected from the group 
consisting of alanine, serine, and threonine. 

30 

6. The enzyme of claim 1 wherein the enzyme includes therein an 
amino acid sequence selected from the group consisting of :(1) D-P-K-L-K-E-I- 
Y-Q-I-V-L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E-A-D-A-I-S- 
R-N-Y-L-E-S-K-G-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L-E-I-H-E-G- 

35 P-M-L-A-R-T-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V-Y-I-E-G-L- 
G-I-R-I-E-D-D-I-L-I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D-L-I-V-L-T 
(SEQ ID NO: 2); (2) M-V-K-V-T-D-Y-S-N-S-K-L-G-K-E-I-A-P-E-V-L- 
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S-V-I-A-S-I-A-T-S-E-V-E-G-I-T-G-H-F~A'E-L-K-E-T-N-L-E-K-V- 

S-R-K~N-L-S-R-D-L-K-I-E-S-K-E-G~I-Y-I-D-V-Y-C-A~L-K-H-G- 

V-N-I-S-K-T-A-N-K-I-<^ 

N-I-H-I-T-Q-I-V-I-E-K (SEQ ID NO: 31); and (3) sequences incorporating one 
5 or more conservative amino acid substitutions in SEQ ID NO:2 or SEQ ID NO: 3 1 , 
wherein the conservative amino acid substitutions are any of the following: (1) any of 
isoleucine, leucine, and valine for any other of these amino acids; (2) aspartic acid for 
glutamic acid and vice versa; (3) glutamine for asparagine and vice versa; and (4) 
serine for threonine and vice versa. 

10 

7. The enzyme of claim 6 wherein the amino acid sequence is D-P- 
K-L-K-E-I-Y-Q-I-V-L-E-S-Q-M-K-^ 
A-D-A-I-S-R-N-Y-L-E-S-K-^Y-G^ 

E-I-H-E-G-P-M-L-A-R-T-I-Q-I>-K-L-Q-V-N-N-C-V-T-V-E-P-G-V- 
15 Y-I-E-G-L-G-I-R-I-E-D~D-I-L~I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D- 
L-I-V-L-T (SEQ ID NO: 2). 

8. The enzyme of claim 6 wherein the amino acid sequence is M-V- 
K-V-T-D-Y-S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S-I-A-T-S-E- 

20 V-E-G-I-T-G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L-S-R-D-L- 
K-I-E-S-K-E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T-A-N-K-I- 
Q-T-S-I-F-N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-I-T-Q-I-V-I-E-K 
(SEQ ID NO: 31). 

25 9. A nucleic acid sequence encoding the enzyme of claim 6. 

10. A nucleic acid sequence encoding the enzyme of claim 7. 

1 1 . A nucleic acid sequence encoding the enzyme of claim 8. 

30 

12. A nucleic acid sequence encoding a substantially purified sortase- 
transamidase enzyme from a Gram-positive bacterium, the enzyme having a subunit 
with a molecular weight of about 41,000 daltons and catalyzing a reaction that 
covalently cross-links the carboxyl terminus of a protein having a sorting signal to the 

35 peptidoglycan of a Gram-positive bacterium, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
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residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L— amino acids and X4 is selected from the group consisting of alanine, 
5 serine, and threonine, and wherein sorting occurs by cleavage between the fourth and 
fifth residues of the LPX3X4G motif, wherein the nucleic acid sequence includes 
therein a sequence selected from the group consisting of: (1) 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 
AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 

10 TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 
TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 
TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 
CTGGTGTTTATATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATT 
TTAATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTT 

15 ATAGTTTTAACATAA (SEQ ID NO: 28); (2) 

ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGAAA^ 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 

20 TGGCATATATATAGATGTATATTGTGCATTAAAACATGGTAATATTTCAAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAAm 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30); or (3) a sequence complementary to SEQ ID NO: 28 
or SEQ ID NO: 30. 

25 

13. A nucleic acid sequence encoding a substantially purified sortase- 
transamidase enzyme from a Gram-positive bacterium, the enzyme having a subunit 
with a molecular weight of about 41 ,000 daltons and catalyzing a reaction that 
covalently cross-links the carboxyl terminus of a protein having a sorting signal to the 

30 peptidoglycan of a Gram-positive bacterium, the sorting signal having (1 ) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 

35 located at residues 3 1 -33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine, and wherein sorting occurs by cleavage between the fourth and 
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fifth residues of the LPX3X4G motif, wherein the nucleic acid sequence hybridizes 
with a sequence selected from the group consisting of: (1) 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 
AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 
5 TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTT 

TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 
TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 
CTGGTGTTTATATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATT 
TTAATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACC 

10 ATAGTTTTAACATAA (SEQ ID NO: 28); (2) 

ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGA^ 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 

1 5 TGGC ATATATATAG ATGTATATTGTGC ATT AAAAC ATGGTAATATTTC AAA 
AACTGCAAACAAAATTCAAACGTCAATTTTT 

AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30) or (3) a sequence complementary to SEQ ID NO: 28 
or SEQ ID NO: 30, with no greater than about a 15% mismatch under stringent 
20 conditions. 

14. The nucleic acid sequence of claim 13 wherein the mismatch is no 
greater than about 5%. 

25 15. The nucleic acid sequence of claim 14 wherein the mismatch is no 

greater than about 2%. 

16. A vector comprising the nucleic acid sequence of claim 9 
operatively linked to at least one control sequence that controls the expression or 

30 regulation of the nucleic acid sequence. 

17. A vector comprising the nucleic acid sequence of claim 10 
operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

35 
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18. A vector comprising the nucleic acid sequence of claim 1 1 
operatively linked to at least one control sequence that controls the expression or - 
regulation of the nucleic acid sequence. 

5 19. A vector comprising the nucleic acid sequence of claim 12 

operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

20. A vector comprising the nucleic acid sequence of claim 13 

10 operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

2 1 . A host cell transfected with the vector of claim 1 6. 
15 22. A host cell transfected with the vector of claim 17. 

23. A host cell transfected with the vector of claim 1 8. 

24. A host cell transfected with the vector of claim 19. 

25. A host cell transfected with the vector of claim 20. 



20 



26. A method for producing a substantially purified sortase- 
transamidase enzyme comprising the steps of: 
25 (a) culturing the host cell of claim 21 under conditions in which the 

host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
sortase-transamidase enzyme. 

30 27. A method for producing a substantially purified sortase- 

transamidase enzyme comprising the steps of: 

(a) culturing the host cell of claim 22 under conditions in which the 
host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
35 sortase-transamidase enzyme. 
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28. A method for producing a substantially purified sortase- 
transamidase enzyme comprising the steps of: 

(a) culturing the host cell of claim 23 under conditions in which the 
host cell expresses the encoded sortase-transamidase enzyme; and 

5 (b) purifying the expressed enzyme to produce substantially purified 

sortase-transamidase enzyme. 

29. A method for producing a substantially purified sortase- 
transamidase enzyme comprising the steps of: 

1 0 (a) culturing the host cell of claim 24 under conditions in which the 

host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
sortase-transamidase enzyme. 

15 30. A method for producing a substantially purified sortase- 

transamidase enzyme comprising the steps of: 

(a) culturing the host cell of claim 25 under conditions in which the 
host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
20 sortase-transamidase enzyme. 

3 1 . Substantially purified sortase-transamidase enzyme produced by 
the process of claim 26. 

25 32. Substantially purified sortase-transamidase enzyme produced by 

the process of claim 27. 

33. Substantially purified sortase-transamidase enzyme produced by 
the process of claim 28. 

30 

34. Substantially purified sortase-transamidase enzyme produced by 
the process of claim 29. 

35. Substantially purified sortase-transamidase enzyme produced by 
35 the process of claim 30. 
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36. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 1; 

5 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

10 

37. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 3; 

15 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

20 

38. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 31; 

25 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

30 

39. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 32; 

35 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 
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(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

5 40. A method for screening a compound for anti— sortase- transamidase 

activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 33; 

(b) performing an assay for sortase-transamidase in the presence and in 
10 the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

15 4 1 . A method for screening a compound for anti-sortase-transamidase 

activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 34; 

(b) performing an assay for sortase-transamidase in the presence and in 
20 the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

25 42. A method for screening a compound for anti-sortase-transamidase 

activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 35; 

(b) performing an assay for sortase-transamidase in the presence and in 
30 the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 



WO 99/09145 



PCT/US98/16229 



-49- 

43. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing an active fraction of sortase-transamidase enzyme from 
a Gram-positive bacterium; 
5 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

10 

44. The method of claim 43 wherein the active fraction of sortase- 
transamidase enzyme is a particulate fraction from Staphylococcus aureus. 



45 . The method of claim 43 wherein the assay for sortase- 

15 transamidase enzyme is performed by monitoring the capture of a soluble peptide that 
is a substrate for the enzyme by its interaction with an affinity resin. 

46. The method of claim 45 wherein the soluble peptide includes a 
sequence of at least six histidine residues and the affinity resin contains nickel. 

20 

47. The method of claim 45 wherein the soluble peptide includes the 
active site of glutathione S-transferase and the affinity resin contains glutathione. 

48. The method of claim 45 wherein the soluble peptide includes the 
25 active site of streptavidin and the affinity resin contains biotin. 

49. The method of claim 45 wherein the soluble peptide includes the 
active site of maltose binding protein and the affinity resin contains amylose. 

30 50. An antibody specifically binding the substantially purified sortase- 

transamidase enzyme of claim 1. 



5 1 . An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 3. 

35 

52. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 3 1 . 
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53. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 32. 

5 54. An antibody specifically binding the substantially purified sortase- 

transamidase enzyme of claim 33. 

55. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 34. 

10 

56. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 35. 

57. A protein molecule comprising the substantially purified sortase- 
15 transamidase enzyme of claim 1 extended at its carboxyl-terminus with a sufficient 

number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column through the histidine residues added at the carboxyl- 
terminus. 

20 58. A protein molecule comprising the substantially purified sortase- 

transamidase enzyme of claim 3 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column through the histidine residues added at the carboxyl- 
terminus. 

25 

59. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 3 1 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

30 

60. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 32 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

35 
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61. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 33 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

5 

62. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 34 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

10 

63. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 35 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

15 

64. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at itscarboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

20 substantially hydrophobic domain of at least 31 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
• substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 

25 and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase of claim 1 ; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide; and 

30 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4 motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 



35 
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65. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

5 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
10 and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transarnidase of claim 3; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide; and 

15 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

20 

66. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

25 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 

30 and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 31 ; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 

35 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
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peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

67. A method for displaying a polypeptide on the surface of a Gram- 
5 positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 

10 substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 

15 polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 32; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 

(c) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 

20 cross-links the amino-terminal portion of the cleaved polypeptide to the 

peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

68. A method for displaying a polypeptide on the surface of a Gram- 
25 positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 31 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 

30 substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 

35 polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 33; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 



WO 99/09145 PCT/US98/16229 

-54- 

(c) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
5 bacterium. 

69. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
10 terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
15 from the motif, wherein X 3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 34; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 

20 transamidase can link the polypeptide; and 

(c) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 

25 bacterium. 

70. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
30 terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 3 1-33 
35 from the motif, wherein X 3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 
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(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 35; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 

(c) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the ammo-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 



71 . A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 

15 carboxyl-terminal sorting signal, the chimeric protein including the polypeptide to be 
displayed, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 

20 being arginine, the two positively charged residues being located at residues 3 1-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) growing the bacterium into which the nucleic acid segment has 
been cloned to express the cloned chimeric protein to generate a chimeric protein 

25 including therein a carboxyl-terminal sorting signal; and 

(c) binding the polypeptide covalently to the cell wall by the enzymatic 
action of a sortase-transamidase expressed by the Gram-positive bacterium involving 
cleavage of the chimeric protein within the LPX3X4G motif so that the polypeptide is 
displayed on the surface of the Gram-positive bacterium in such a way that the 

30 polypeptide is accessible to a ligand. 

72. A polypeptide displayed on the surface of a Gram-positive 
bacterium by covalent linkage of an amino-acid sequence of LPX3X4 derived from 
cleavage of an LPX3X4G motif, wherein X3 is any of the twenty naturally-occurring 

35 L-amino acids and X4 is selected from the group consisting of alanine, serine, and 
threonine, the polypeptide being displayed on the surface of the Gram-positive 
bacterium in such a way that the polypeptide is accessible to a ligand. 



WO 99/09145 



PCT/US98/16229 



-56- 

73. A covalent complex comprising: 

(a) the polypeptide of claim 72; and 

(b) an antigen or hapten covalently cross-linked to the polypeptide. 

5 

74. The covalent complex of claim 73 wherein an antigen is covalently 
cross-linked to the polypeptide. 

75. The covalent complex of claim 73 wherein a hapten is covalently 
10 cross-linked to the peptide. 

76. A method for vaccination of an animal comprising the step of 
immunizing the animal with the displayed polypeptide of claim 72 to generate an 
immune response against the displayed polypeptide. 

15 

77. A method for vaccination of an animal comprising the step of 
immunizing the animal with the covalent complex of claim 73 to generate an immune 
response against the antigen or hapten of the covalent complex. 

20 78. A method for screening for expression of a cloned polypeptide 

comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 

25 carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 3 1-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 

30 serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; the substantially purified sortase-transamidase enzyme of claim 1; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide through the sorting signal; 

35 (c) binding the chimeric protein covalently to the cell wall by the 

enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
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that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
5 partner. 

79. A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
10 sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 

LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
15 located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 3; and 

20 (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 

25 that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

30 

80. A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
35 LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
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positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

5 (b) forming a reaction mixture including: (i) the expressed chimeric 

protein; (ii) the substantially purified sortase-transamidase enzyme of claim 3 1 ; and 
(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
10 enzymatic action of a sortase-transamidase expressed by the Gram-positive 

bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
15 partner to screen the chimeric protein for reactivity with the labeled specific binding 

partner. 

81 . A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

20 (a) expressing a cloned polypeptide as a chimeric protein having a 

sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 

25 positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
30 protein; (ii) the substantially purified sortase-transamidase enzyme of claim 32; and 

(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 

35 bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 
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(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

5 82. A method for screening for expression of a cloned polypeptide 

comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 

10 carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 3 1-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 

1 5 serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 33; and 
(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

20 (c) binding the chimeric protein covalently to the cell wall by the 

enzymatic action of a sortase-transamidase expressed by the Gram— positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

25 (d) reacting the displayed polypeptide with a labeled specific binding 

partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

83. A method for screening for expression of a cloned polypeptide 
30 comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
35 residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
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occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 34; and 

5 (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 

10 that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

15 

84. A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 

20 LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 

25 occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 35; and 
(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 

30 transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enrymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 

35 such a way that the polypeptide is accessible to a ligand; and 
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(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

5 85. A method for screening for expression of a cloned polypeptide 

comprising the steps of: 

(a) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 
carboxyl-terminal sorting signal, the chimeric protein including the polypeptide 

10 whose expression is to be screened, the sorting signal having: (1) a motif of LPX3X4G 
therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids carboxyl to 
the motif; and (3) a charged tail region with at least two positively charged residues 
carboxyl to the substantially hydrophobic domain, at least one of the two positively 
charged residues being arginine, the two positively charged residues being located at 

1 5 residues 3 1 -33 from the motif, wherein X3 is any of the twenty naturally-occurring 
L-amino acids and X4 is selected from the group consisting of alanine, serine, and 
threonine; 

(b) growing the bacterium into which the nucleic acid segment has 
been cloned to express the cloned chimeric protein to generate a chimeric protein 

20 including therein a carboxyl-terminal sorting signal; 

(c) binding the polypeptide covalently to the cell wall by the enzymatic 
action of a sortase-transamidase expressed by the Gram-positive bacterium involving 
cleavage of the chimeric protein within the LPX3X4G motif so that the polypeptide is 
displayed on the surface of the Gram-positive bacterium in such a way that the 

25 polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

30 86. A method for the diagnosis or treatment of a bacterial infection 

caused by a Gram-positive bacterium comprising the steps of: 

(a) conjugating an antibiotic or a detection reagent to a protein 
including therein a carboxyl-terminal sorting signal to produce a conjugate, the 
carboxyl-terminal sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

35 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
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being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; and 
(b) introducing the conjugate to an organism infected with a Gram- 
5 positive bacterium in order to cause the conjugate to be sorted and covalently cross- 
linked to the cell walls of the bacterium in order to treat or diagnose the infection. 



87. The method of claim 86 wherein an antibiotic is conjugated to the 

protein. 

10 

88. The method of claim 87 wherein the antibiotic is selected from the 
group consisting of a penicillin, ampicillin, vancomycin, gentamicin, streptomycin, a 
cephalosporin, amikacin, kanamycin, neomycin, paromomycin, tobramycin, 
ciprofloxacin, clindamycin, rifampin, chloramphenicol, norfloxacin, and a derivative 

1 5 of these antibiotics. 



89. The method of claim 86 wherein a detection reagent is conjugated 

to the protein. 



20 90. A conjugate comprising an antibiotic or a detection reagent 

covalently conjugated to a protein including therein a carboxyl-terminal sorting signal 
to produce a conjugate, the carboxyl-terminal sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 

25 residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 3 1-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine. 



30 



9 1 . The conjugate of claim 90 wherein an antibiotic is conjugated to 

the protein. 



35 



92. The conjugate of claim 91 wherein the antibiotic is selected from 
the group consisting of a penicillin, ampicillin, vancomycin, gentamicin, 
streptomycin, a cephalosporin, amikacin, kanamycin, neomycin, paromomycin, 
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tobramycin, ciprofloxacin, clindamycin, rifampin, chloramphenicol, norfloxacin, and 
a derivative of these antibiotics. 

93. The conjugate of claim 90 wherein a detection reagent is 
5 conjugated to the protein. 

94. A composition comprising: 

(a) the conjugate of claim 90; and 

(b) a pharmaceutically acceptable carrier. 

10 

95. A substantially purified protein having at least about 50% match 
with best alignment, in at least one subunit of the protein, with the amino acid 
sequences of at least one of the putative Bacillus peptidase (SEQ ID NO: 3), the 
aminopeptidase P of Lactococcus lactis (SEQ ID NO: 4), or the proline dipeptidase of 

15 Lactobacillus delbrueckii lactis (SEQ ID NO: 5) and having sortase-transamidase 
activity. 

96. The substantially purified protein of claim 95 wherein the match 
with best alignment with the amino acid sequences of at least one of the putative 

20 Bacillus peptidase (SEQ ID NO: 3), the aminopeptidase P of Lactococcus lactis (SEQ 
ID NO: 4), or the proline dipeptidase of Lactobacillus delbrueckii lactis (SEQ ID NO: 
5) is at least about 60%. 

97. The substantially purified protein of claim 96 wherein the match 
25 with best alignment with the amino acid sequences of at least one of the putative 

Bacillus peptidase (SEQ ID NO: 3), the aminopeptidase P of Lactococcus lactis (SEQ 
ID NO: 4), or the proline dipeptidase of Lactobacillus delbrueckii lactis (SEQ ID NO: 
5) is at least about 70%. 

30 98. A substantially purified protein having sortase-transamidase 

activity and a hydrophobicity profile of at least one subunit of the protein, that, 
determined as the mean absolute value of the hydrophobicity difference per residue, 
differs from the hydrophobicity profile of a putative Bacillus peptidase (SEQ ID NO: 
3) by no more than about 2 units on the hydrophobicity scale. 



35 
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99. The substantially purified protein of claim 98 wherein the 
hydrophobicity profile differs from the hydrophobicity profile of the putative Bacillus 
peptidase (SEQ ID NO: 3) by no more than about 1 unit. 

5 100. The substantially purified protein of claim 99 wherein the 

hydrophobicity profile differs from the hydrophobicity profile of the putative Bacillus 
peptidase (SEQ ID NO: 3) by no more than about 0.5 unit. 

101 . A nucleic acid sequence encoding the substantially purified 
1 0 protein of claim 95 . 

102. A nucleic acid sequence encoding the substantially purified 
protein of claim 98. 

15 103. A vector comprising the nucleic acid sequence of claim 101 

operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

104. A vector comprising the nucleic acid sequence of claim 102 
20 operatively linked to at least one control sequence that controls the expression or 

regulation of the nucleic acid sequence. 

105. A host cell transfected with the vector of claim 103. 
25 106. A host cell transfected with the vector of claim 1 04. 

107. A method for producing a substantially purified protein having 
sortase-transamidase activity comprising the steps of: 

(a) culturing the host cell of claim 105 under conditions in which the 
30 host cell expresses the protein having sortase-transamidase activity; and 

(b) purifying the expressed protein to produce substantially purified 
protein having sortase-transamidase activity. 

108. A method for producing a substantially purified protein having 
35 sortase-transamidase activity comprising the steps of: 

(a) culturing the host cell of claim 106 under conditions in which the 
host cell expresses the protein having sortase-transamidase activity; and 
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(b) purifying the expressed protein to produce substantially purified 
protein having sortase-transamidase activity. 

109. A substantially purified protein having sortase-transamidase 
5 activity and a hydrophobicity profile of at least one subunit of the protein, that, 
determined as the mean absolute value of the hydrophobicity difference per residue, 
differs from the hydrophobicity profile of the sequence of SEQ ID NO: 31 by no more 
than about 2 units on the hydrophobicity scale. 

10 110. The substantially purified protein of claim 109 wherein the 

hydrophobicity profile differs from the hydrophobicity profile of the sequence of SEQ 
ID NO: 3 1 by no more than about 1 unit. 



111. The substantially purified protein of claim 1 10 wherein the 
1 5 hydrophobicity profile differs from the hydrophobicity profile of the sequence of SEQ 
ID NO: 31 by no more than about 0.5 unit. 



1 12. A nucleic acid sequence encoding the substantially purified 
protein of claim 109. 

20 

1 13. A vector comprising the nucleic acid sequence of claim 1 12 
operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 



25 1 1 4. A host cell transfected with the vector of claim 113. 

1 15. A method for producing a substantially purified protein having 
sortase-transamidase activity comprising the steps of: 

(a) culturing the host cell of claim 1 14 under conditions in which the 
30 host cell expresses the protein having sortase-transamidase activity; and 

(b) purifying the expressed protein to produce substantially purified 
protein having sortase-transamidase activity. 
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SEQUENCE LISTING 

<110> Schneewind, Olaf 
5 Ton-That, Hung 

Mazmanian, Sarkis 

<120> IDENTIFICATION OF SORTASE GENE 

10 

<130> 30435. 46WO01 

<150> 60/055,662 
<151> 1997-08-14 

15 

<160> 34 

<170> FastSEQ for Windows Version 3.0 

20 <210> 1 

<211> 5 
<212> PRT 

<213> Artificial Sequence 

25 <220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 1 
Leu Pro Xaa Thr Gly 
30 1 5 







<210> 


2 




























<211> 


121 




























<212> 


PRT 
























35 




<213> 


Staphylococcus Aureus 


















<400> 


2 


























Asp 


Pro Lys 


Leu 


Lys 


Glu 


He 


Tyr 


Gin 


lie 


Val 


Leu 


Glu 


Ser Gin 


Met 




1 






5 










10 








15 




40 


Lys 


Ala lie 


Asn 


Glu 


He 


Arg 


Pro 


Gly 


Met 


Thr 


Gly 


Ala 


Glu Ala 


Asp 








20 










25 










30 




Ala 


lie Ser 


Arg 


Asn 


Tyr 


Leu 


Glu 


Ser 


Lys 


Gly 


Tyr 


Gly 


Lys Glu 


Phe 






35 










40 










45 








Gly His Ser 


Leu 


Gly 


His 


Gly 


He 


Gly 


Leu 


Glu 


He 


His 


Glu Gly 


Pro 


45 




50 








55 










60 








Met 


Leu Ala 


Arg 


Thr 


He 


Gin 


Asp 


Lys 


Leu 


Gin 


Val 


Asn 


Asn Cys 


Val 




65 








70 










75 






60 




Thr 


Val Glu 


Pro 


Gly 


Val 


Tyr 


He 


Glu 


Gly 


Leu 


Gly 


Gly 


He Arg 


He 










85 










90 








95 




50 


Glu 


Asp Asp 


He 


Leu 


He 


Thr 


Glu 


Asn 


Gly 


Cys 


Gin 


Val 


Phe Thr 


Lys 








100 










105 










110 






Cys 


Thr Lys 


Asp 


Leu 


He 


Val 


Leu 


Thr 


















115 










120 
















55 




<210> 


3 




























<211> 


353 




























<212> 


PRT 




























<213> 


Bacillus Sp. 




















60 




<400> 


3 


























Met 


Lys Leu 


Glu 


Lys 


Leu 


Arg 


Asn 


Leu 


Phe 


Gly 


Gin 


Leu 


Gly He 


Asp 




1 






, 5 










10 








15 






Gly Met Leu 


He 


Thr 


Ser 


Asn 


Thr 


Asn 


Val 


Arg 


Val 


Met 


Thr Gly. 


Phe 








20 










25 










30 




65 


Thr 


Gly Ser 


Ala Gly Leu 


Ala 


Val 


He 


Ser 


Gly 


Asp 


Lys 


Ala Ala 


Phe 






35 










40 










45 








lie 


Thr Asp 


Phe Arg Tyr 


Thr 


Glu 


Gin 


Ala 


Lys 


Val 


Gin 


Val Lys 


Gly 






50 








55 










60 








Phe 


Glu He 


He 


Glu His 


Gly 


Gly 


Ser 


Leu 


He 


Gin 


Thr 


Thr Ala 


Asp 
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10 



15 



20 



25 



30 



35 



65 

Thr Val 

Met Thr 

Glu Leu 

Ser Ser 
130 
Asp Ala 
145 

He Ala 

Asp Ser 

Leu Pro 

Val Thr 
210 
Thr Arg 
225 

Tyr Gin 

Pro Gly 

Ala Ala 

Leu Gly 
290 
Ala He 
305 

He Pro 
Glu Asn 
Leu 



Glu Ser Phe 
85 

Tyr Gly Thr 

100 
Val Pro Val 
115 

Glu Glu He 

Phe Arg His 

Val Ala Asn 
165 

Ser Ser Phe 

180 
His Gly Val 
195 

Leu Asp Phe 

Thr Val Ala 

Val Val Phe 
245 

Met Thr Gly 

260 
Lys Gly Tyr 
27S 

Met Glu Val 

Leu Glu Pro 

Glu Thr Gly 
325 

Gly Asn Arg 
340 



70 

Gly He Lys 

Tyr Ala Ser 

Ala Glu Ser 
120 

Lys lie Leu 

135 
He Leu Thr 
150 

Glu Leu Glu 

Asp Met He 

Ala Ser Asp 
200 

Gly Ala Tyr 

215 
Val Gly Gin 
230 

Asp Ala Gin 

Lys Glu Ala 

Gly Asp Tyr 
280 

His Glu Ser 

295 
Gly Met Val 
310 

Gly Val Arg 
Thr He Thr 



Arg Leu 

90 
Tyr Ser 
105 

Val Glu 

Glu Glu 

Phe Met 

Phe Tyr 
170 
Val Ala 
185 

Lys Leu 

Tyr Lys 

Pro Ser 

Ala Leu 
250 
Asp Ala 
265 

Phe Gly 

Pro Gly 

Val Thr 

He Glu 
330 
His Ser 
345 



75 

Gly Phe 

Ala Val 

Lys Leu 

Ala Ala 
140 
Lys Pro 
155 

Met Arg 

Ser Gly 

He Glu 

Gly Tyr 
220 
Asp Gin 
235 

Gly Val 

Leu Thr 

His Ser 

Leu Ser 
300 
Val Glu 
315 

Asp Asp 
Pro Lys 



Glu Gin 

He Ser 
110 
Arg Leu 
125 

Lys He 

Gly He 

Ser Gin 

Leu Arg 
190 
Ser Gly 
205 

Cys Ser 

Leu Lys 

Ala His 

Arg Asp 
270 
Thr Gly 
285 

Val Arg 

Pro Gly 

He Val 

Glu Leu 
350 



80 
Asn Ser 
95 

Asp Ala 

lie Lys 

Ala Asp 

Ser Glu 
160 
Gly Ala 
175 

Ser Ser 

Asp Leu 

Asp lie 

Glu He 
240 
lie Lys 
255 

His lie 

His Gly 

Ser Ser 

lie Tyr 
320 
lie Thr 
335 

lie lie 



<210> 4 
40 <211> 352 

<212> PRT 

<213> Lactococcus Lactis 
<400> 4 



Met Arg 


lie 


Glu 


Lys 


Leu 


Lys 


Val 


Lys 


Met Leu Thr Glu Asn lie Asp 


1 








5 










10 15 


Ser 


Leu 


Leu 


lie 


Thr 


Asp Met 


Lys 


Asn 


lie Phe Tyr Leu Thr Gly Phe 








20 










25 


30 


Ser Gly 


Thr 


Ala 


Gly 


Thr 


Val 


Phe 


Leu 


Thr Gin Lys Arg Asn lie Phe 






35 










40 




45 


Met 


Thr 


Asp 


Ser 


Arg 


Tyr 


Ser 


Glu 


Met Ala Arg Gly Leu lie Lys Asn 




50 










55 






60 


Phe 


Glu 


lie 


lie 


Glu 


Thr Arg 


Asp 


Pro 


He Ser Leu Leu Thr Glu Leu 


65 










70 








75 80 


Ser 


Ala 


Ser 


Glu 


Ser 


Val 


Lys 


Asn 


Met 


Ala Phe Glu Glu Thr Val Asp 










85 










90 95 


Tyr Ala 


Phe 


Phe 


Lys 


Arg 


Leu 


Ser 


Lys 


Ala Ala Thr Lys Leu Asp Leu 








100 










105 


110 


Phe 


Ser 


Thr 


Ser 


Asn 


Phe 


Val 


Leu 


Glu 


Leu Arg Gin lie Lys Asp Glu 






115 










120 




125 


Ser 


Glu 


He 


Ser 


Leu 


lie 


Lys 


Lys 


Ala 


Cys Glu He Ala Asp Glu Ala 




130 










135 






140 


Phe 


Met 


Ser 


Ala 


Leu 


Arg 


Phe 


lie 


Glu 


Pro Gly Arg Thr Glu lie Glu 


145 










150 








155 160 


Val 


Ala 


Asn 


Phe 


Leu 


Asp 


Phe 


Lys 


Met Arg Asp Leu Glu Ala Ser Gly 










165 










170 175 


lie 


Ser 


Phe 


Glu 


Thr 


lie 


Val 


Ala 


Ser Gly Lys Arg Ser Ser Leu Pro 








180 










185 


190 


His 


Gly 


Val 


Ala 


Thr 


Ser 


Lys 


Met 


He 


Gin Phe Gly Asp Pro Val Thr 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



195 


















Tin Aon PHo 
ixc n9{J fiio 


wiy 


- 


iyr 


_ 

iyr 


UlU 


ni9 


Tyr 


Aia oer Asp net Tnr Arg 


210 








215 










Thr T1p Phft 


vox 


fil u 
uiy 


OCX 


VOX 


ASp 


ASp 


Lys 


Met Arg Thr He Tyr Glu 


225 


















ill J. VOX IU(| 


jjys 


Al a 
Ala 


Asn 


(21 n 
UlU 


Ala 


L6U 


T 1 A 

lie 


i»ys Qsin vai Lys Ala Gly 




















Met Thr Tvr 


Ala 


Gin 


iyx 


Asp 


Asn 


He 


Pro 


/*xy uiu vox lie v?xu x#ys 




260 










265 




270 


Ala Asp Phe 


Gly 


Gin 


Tyr 


Phe 


Thr 


His 


Gly 


He Gly His Gly Leu Gly 


275 










280 






285 


Leu Asp Val 


His 


Glu 


lie 


Pro 


Tyr 


Phe 


Asn 


Gin Ser Met Thr Glu Asn 


290 








295 








300 


Gin Leu Arg 


Ser 


Gly 


Met 


Val 


He 


Thr 


Asp 


Glu Pro Gly lie Tyr Leu 


305 






310 










315 320 


Pro Glu Phe 


Gly 


Gly 


Val 


Arg 


lie 


Glu 


Asp 


Asp Leu Leu Val Thr Glu 






325 










330 


335 


Asn Gly Cys 


Glu 


Val 


Leu 


Thr 


Lys 


Ala 


Pro 


Lys Glu Leu He Val He 




340 










345 




350 



<210> 5 
<211> 368 
<212> PRT 

<213> Lactobacillus Delbrueckii Lactis 



<400> 


5 


















Met Asn 


Leu 


Asp 


Lys 


Leu 


Gin 


Asn 


Trp 


Leu 


Gin 


Glu Asn Gly Met Asp 


1 • 






5 










10 




15 


Val Ala 


Tyr 


Val 


Ser 


Ser 


Pro 


Thr 


Thr 


He 


Asn 


Tyr Phe Thr Gly Phe 






20 










25 






30 


He Thr 


Asp 


Pro 


Glu 


Glu 


Arg 


He 


Phe 


Lys 


Leu 


Phe Ala Phe Lys Asp 




35 










40 








45 


Ala Glu 


Pro 


Phe 


Leu 


Phe 


Cys 


Pro 


Ala 


Leu 


Asn 


Tyr Glu Glu Ala Lys 


50 










55 










60 


Ala Ser 


Ala 


Trp 


Asp 


Gly 


Asp 


Val 


Val 


Gly 


Tyr 


Leu Asp Ser Glu Asp 


65 








70 










75 


80 


Pro Trp 


Ser 


Lys 


He 


Ala 


Glu 


Glu 


He 


Lys 


Lys 


Arg Thr Lys Asp Tyr 








85 










90 




95 


Gin Asn 


Trp 


Ala 


Val 


Glu 


Lys 


Asn 


Gly 


Leu 


Thr 


Val Ala His Tyr Gin 






100 










105 






110 


Ala Leu 


His 


Ala 


Gin 


Phe 


Pro 


Asp 


Ser 


Asp 


Phe 


Ser Lys Asp Leu Ser 




115 










120 








125 


Asp Phe 


He 


Ala 


His 


He 


Arg 


Leu 


Phe 


Lys 


Thr 


Glu Ser Glu Leu Val 


130 










135 










140 


Lys Leu 


Arg 


Lys 


Ala 


Gly 


Glu 


Glu 


Ala 


Asp 


Phe 


Ala Phe Gin He Gly 


145 








150 










155 


160 


Phe Glu 


Ala 


Leu 


Arg 


Asn 


Gly 


Val 


Thr 


Glu 


Arg 


Ala Val Val Ser Gin 








165 










170 




175 


He Glu 


Tyr 


Gin 


Leu 


Lys 


Leu 


Gin 


Lys 


Gly 


Val 


Met Gin Thr Ser Phe 






180 










185 






190 


Asp Thr 


He 


Val 


Gin 


Ala 


Gly 


Lys 


Asn 


Ala 


Ala 


Asn Pro His Gin Gly 




195 










200 








205 


Pro Ser 


Met 


Asn 


Thr 


Val 


Gin 


Pro 


Asn 


Glu 


Leu 


Val Leu Phe Asp Leu 


210 










215 










220 


Gly Thr 


Met 


His 


Glu 


Gly 


Tyr 


Ala 


Ser 


Asp 


Ser 


Ser Arg Thr Val Ala 


225 








230 










235 


240 


Tyr Gly 


Glu 


Pro 


Thr 


Asp 


Lys 


Met 


Arg 


Glu 


He 


Tyr Glu Val Asn Arg 








245 










250 




255 


Thr Ala 


Gin 


Gin 


Ala 


Ala 


He 


Asp 


Ala 


Ala 


Lys 


Pro Gly Met Thr Ala 






260 










265 






270 


Ser Glu 


Leu 


Asp 


Gly 


Val 


Ala 


Arg 


Lys 


He 


He 


Thr Asp Ala Gly Tyr 




275 










280 








285 


Gly Glu 


Tyr 


Phe 


He 


His 


Arg 


Leu 


Gly 


His 


Gly 


He Gly Met Glu Val 


290 










295 










300 


His Glu 


Phe 


Pro 


Ser 


He 


Ala 


Asn 


Gly 


Asn 


Asp 


Val Val Leu Glu Glu 


305 








310 










315 


320 


Gly Met 


Cys 


Phe 


Ser 


He 


Glu 


Pro 


Gly 


He 


Tyr 


He Pro Gly Phe Ala 








325 










330 




335 


Gly Val 


Arg 


He 


Glu 


Asp 


Cys 


Gly 


Val 


Leu 


Thr 


Lys Asp Gly Phe Lys 
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340 . 345 350 

Pro Phe Thr His Thr Ser Lys Glu Leu Lys Val Leu Pro Val Lys Glu 
355 360 365 

5 <210> 6 

<211> 25 
<212> PRT 

<213> Staphylococcus Aureus 

10 <400> 6 

Glu Glu Asn Pro Phe He Gly Thr Thr Val Phe Gly Gly Leu Ser Leu 

15 10 15 

Ala Leu Gly Ala Ala Leu Leu Ala Gly 
20 25 



15 



20 



<210> 7 
<211> 23 
<212> PRT 

<213> Staphylococcus Aureus 



<400> 7 

Gly Glu Glu Ser Thr Asn Lys Gly Met Leu Phe Gly Gly Leu Phe Ser 

1 5 10 15 

He Leu Gly Leu Ala Leu Leu 
25 20 

<210> 8 
<211> 24 
<212> PRT 

30 <213> Staphylococcus Sobrinos 

<400> 8 

Asp Ser Ser Asn Ala Tyr Leu Pro Leu Leu Gly Leu Val Ser Leu Thr 
1 5 10 15 

35 Ala Gly Phe Ser Leu Leu Gly Leu 
20 

<210> 9 
<211> 24 
40 <212> PRT 

<213> Enterococcus Faecalis 

<400> 9 

Glu Lys Gin Asn Val Leu Leu Thr Val Val Gly Ser Leu Ala Ala Met 
45 1 5 10 15 

Leu Gly Leu Ala Gly Leu Gly Phe 
20 

<210> 10 s 
50 <211> 23 

<212> PRT 

<213> Streptococcus Pyogenes 
<400> 10 

55 Ser He Gly Thr Tyr Leu Phe Lys He Gly Ser Ala Ala Met He Gly 
1 5 10 15 

Ala He Gly He Tyr He Val 
20 

60 <210> 11 

<211> 22 
<212> PRT 

<213> Listeria Monocytogenes 

65 <400> 11 

Asp Ser Asp Asn Ala Leu Tyr Leu Leu Leu Gly Leu Leu Ala Val Gly 

15 10 15 

Thr Ala Met Ala Leu Thr 
20 
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10 



15 



<210> 12 

<211> 5 

<212> PRT 

<213> Staphylococcus Aureus 

<400> 12 
Arg Arg Arg Glu Leu 
1 5 

<210> 13 
<211> 9 
<212> PRT 

<213> Staphylococcus Aureus 
<400> 13 

Arg Arg Asn Lys Lys Asn His Lys Ala 
1 5 

20 <210> 14 

<211> 5 
<212> PRT 

<213> Staphylococcus Sobrinus 

25 <400> 14 

Arg Arg Lys Gin Asp 

1 5 

<210> 15 
30 <211> 7 

<212> PRT 

<213> Enterococcus Faecal is 
<400> 15 

35 Lys Arg Arg Lys Glu Thr Lys 
1 5 

<210> 16 
<211> 5 
40 <212> PRT 

<213> Streptococcus Pyogenes 

<400> 16 
Lys Arg Arg Lys Ala 
45 1 5 

<210> 17 
<211> 8 
<212> PRT 

50 <213> Actinomyces Viscosus 

<400> 17 

Lys Arg Arg His Val Ala Lys His 

1 5 

55 

<210> 18 

<211> 5 

<212> PRT 

<213> Streptococcus Aglactiae 

<400> 18 
Lys Arg Arg Lys Ser 
1 5 

65 <210> 19 

<211> 6 



60 



<212> PRT 

<213> Streptococcus Pyogenes 
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<400> 19 
Lys Arg Lys Glu Glu Asn 
1 5 

5 <210> 20 

<211> 5 
<212> PRT 

<213> Artificial Sequence 

10 <220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 20 
Arg Arg Arg Glu Ser 
15 1 5 



<210> 21 
<211> 5 
<212> PRT 

20 <213> Artificial Sequence 

<220> 

<223> Mutated derived from Streptococcus Pyogenes 

25 <400> 21 

Arg Arg Arg Ser Leu 
1 5 



<210> 22 
30 <211> 5 

<212> PRT 

<213> Artificial Sequence 



<220> 

35 <223> Mutated derived from Streptococcus Pyogenes 

<400> 22 
Arg Arg Ser Glu Leu 
1 5 

40 

<210> 23 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

45 

<220> 

<223> Mutated derived from Streptococcus Pyogenes 



<400> 23 
50 Arg Ser Arg Glu Leu 
1 5 

<210> 24 
<211> 5 
55 <212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutated derived from Streptococcus Pyogenes 

60 

<400> 24 
Ser Arg Arg Glu Leu 
1 5 



65 <210> 25 

<211> 5 
<212> PRT 

<213> Artificial Sequence 
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10 



IS 



20 



25 



30 



<220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 25 
Arg Arg Ser Ser Ser 
1 5 

<210> 26 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 26 
Arg Ser Arg Ser Ser 

1 5 

<210> 27 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 27 
Ser Arg Arg Ser Ser 
1 5 



35 



<210> 28 
<211> 1017 
<212> DNA 

<213> Staphylococcus Aureus 



40 



45 



50 



55 



<220> 
<221> CDS 
<222> (716) - 



(1014) 



<400> 28 

cgactctaga ggacaagcaa ctaagcaggc gccaaattat gaaattatta 
tactattatt ggtgagatta aaaaattgct acnccaanaa aattttgaaa 
tgaggggcat catgtnnttt atgatacata ccttgaatta aataaaagcc 
aataagcatt tctaatactg tagataaaat tagagacgtc caagatgctg 
tttaattcaa aaagcagcta atattgttga tgaaacatat gaatatattt 
aaaagcnngc atgactgaaa aanaattaaa ggcaatatta gaaagccaaa 
aggagcagat ggaccntctt tcgatacgat tgtngcatct ggtcctagag 
accatggtgt tgcaagtgat aaaattattg aaaaaggcga catgattaca 
ggcgcnttnt tataaccggc tattgttcaa attttactan aacattttgc 
aacccgatcc ntaaactgaa aagaaattgt agcatctgtt catagaggtg 
tggtgttgca agtgataaaa ttattgaaaa aggcgacatg atnacattag 
gtattataac ggctattgtt cagatattac tagaacattt gctattggag 



atcgtaaatc 
atgttngttt 
gtatatcatt 
acgaaattgc 
taactgttgt 
tgctanaatt 
gtgcattacc 
ttanattttg 
nattgggaaa 
cattaccaca 
atttnggcgc 
aacca gat 
Asp 
1 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
718 



60 



65 



cct aaa ctg aaa gaa ata tat caa ata gta ctt gaa tct caa atg aaa 766 
Pro Lys Leu Lys Glu lie Tyr Gin lie Val Leu Glu Ser Gin Met Lys 
5 10 15 

gca att aat gag att aga cct ggc atg act ggt gca gaa get gat gec 814 
Ala lie Asn Glu He Arg Pro Gly Met Thr Gly Ala Glu Ala Asp Ala 
20 25 30 

att tea aga aac tat tta gag tea aaa ggg tat gga aaa gaa ttt gga 862 
He Ser Arg Asn Tyr Leu Glu Ser Lys Gly Tyr Gly Lys Glu Phe Gly 
35 40 45 



cat tea eta gga cat ggt att ggt tta gaa ate cat gaa ggg cca atg 



910 
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His 


Ser 


Leu Gly 


MIS 


Gly 


He Gly 


Leu Glu He His Glu Gly Pro Met 






50 










55 








60 




65 






ctg 


gnt 


cgt 


acg 


ata 


caa 


gat 


aaa 


ctt 


caa gtt 


aac aac 


tgt gtt aca 


958 


5 


Leu 


Xaa 


Arg 


Thr 


He 


Gin Asp Lys 


Leu 


Gin Val 


Asn Asn Cys Val Thr 














70 










75 




80 






gta 


gna 


cct 


ggt 


gtt 


nat 


ata 


gaa 


ggt 


ttg ggc 


ggt ata 


aga ata gaa 


1006 




Val 


Xaa 


Pro Gly 


Val 


Xaa 


He 


Glu 


Gly Leu Gly Gly He Arg He Glu 




10 








B5 










90 






95 






gag 


ata 


tt taa 


















1017 


1 * 


Glu 


lie 
























15 




<210> 


29 
























<211> 


100 
























<212> 


PRT 




















20 




<213> 


Staphylococcus Aureus 
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85 90 95 

att tct aat atg aca gcg ata gaa cct aag caa att aat att cac att 336 
lie Ser Asn Met Thr Ala lie Glu Pro Lys Gin He Asn He His He 
100 105 110 

aca caa ate gtt att gaa aag taa 360 
Thr Gin He Val He Glu Lys 
115 



<210> 31 

<211> 119 

<212> PRT 

15 <213> Staphylococcus Aureus 

<400> 31 

Met Val Lys Val Thr Asp Tyr Ser Asn Ser Lys Leu Gly Lys Val Glu 
15 10 15 

20 He Ala Pro Glu Val Leu Ser Val He Ala Ser He Ala Thr Ser Glu 
20 25 30 

Val Glu Gly He Thr Gly His Phe Ala Glu Leu Lys Glu Thr Asn Leu 

35 40 45 

Glu Lys Val Ser Arg Lys Asn Leu Ser Arg Asp Leu Lys He Glu Ser 
25 50 55 60 

Lys Glu Asp Gly He Tyr He Asp Val Tyr Cys Ala Leu Lys His Gly 
65 70 75 80 

Asn He Ser Lys Thr Ala Asn Lys He Gin Thr Ser He Phe Asn Ser 
85 90 95 

30 He Ser Asn Met Thr Ala He Glu Pro Lys Gin He Asn He His He 
100 105 110 

Thr Gin He Val He Glu Lys 
115 

35 <210> 32 

<211> 19 

<212> PRT 

<213> Artificial Sequence 

40 <220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 32 

His His His His His His Ala Gin Ala Leu Glu Pro Thr Gly Glu Glu 
45 1 5 10 15 

Asn Pro Phe 

<210> 33 

50 <211> 1119 

<212> DNA 

<213> Streptococcus Pyogenes 
<220> 

55 <221> CDS 

<222> (1)... (995) 

<400> 33 

atg eta caa tat tct caa aag tta cca aag gag ttc gcg atg tea gga 48 

60 Met Leu Gin Tyr Ser Gin Lys Leu Pro Lys Glu Phe Ala Met Ser Gly 
15 10 15 

ttt tta gaa caa cga tta ggt cac tgc eta agg cag atg gca gag aag 96 

Phe Leu Glu Gin Arg Leu Gly His Cys Leu Arg Gin Met Ala Glu Lys 
65 20 25 30 

ggg eta gag get ctt eta gtc ace cat tta ace aat agt tat tac ttg 144 
Gly Leu Glu Ala Leu Leu Val Thr His Leu Thr Asn Ser Tyr Tyr Leu 
35 40 45 



-10- 



aca ggt ttt tct gga act gca gca act gtt ttg ata acg gcc aaa cgt 
Thr Gly Phe Ser Gly Thr Ala Ala Thr Val Leu lie Thr Ala Lys Arg 
50 55 60 

cgt gtt ttg ate aca gat tea cgt tat ace ttg ctt get aaa get agt 
Arg Val Leu He Thr Asp Ser Arg Tyr Thr Leu Leu Ala Lys Ala Ser 
65 70 75 80 

gtt gag gga ttt gat att ate gaa age cgc acg ccg ctt aag gtt gtg 
Val Glu Gly Phe Asp He He Glu Ser Arg Thr Pro Leu Lys Val Val 
85 90 95 

gca gaa ttg tta gag get gat caa ata gat tgc ctt ggt ttt gag gac 
Ala Glu Leu Leu Glu Ala Asp Gin He Asp Cys Leu Gly Phe Glu Asp 
100 105 110 

cag gta teg ttt tct ttt tac cag gcc atg caa gca gaa ctg tea gga 
Gin Val Ser Phe Ser Phe Tyr Gin Ala Met Gin Ala Glu Leu Ser Gly 
115 120 125 

ata acc ttg ctt get cag tea ggt ttt gtg gag cat tta cgt ctt att 
He Thr Leu Leu Ala Gin Ser Gly Phe Val Glu His Leu Arg Leu He 
130 135 140 

aag gac gcc tct gaa ate gat acc att get aaa gcg tgc teg ate tea 
Lys Asp Ala Ser Glu He Asp Thr He Ala Lys Ala Cys Ser He Ser 
145 150 155 160 

gac aaa gca ttt gaa gat get ctt gat ttt att aaa cca ggg aca acc 
Asp Lys Ala Phe Glu Asp Ala Leu Asp Phe He Lys Pro Gly Thr Thr 
165 170 175 

act gaa cgt gac ctg get aat ttt tta gat ttt cgt atg cgt cag tat 
Thr Glu Arg Asp Leu Ala Asn Phe Leu Asp Phe Arg Met Arg Gin Tyr 
180 185 190 

ggt gcc age ggc aca tea ttt gat ate att gta get tea ggc tat etc 
Gly Ala Ser Gly Thr Ser Phe Asp He He Val Ala Ser Gly Tyr Leu 
195 200 205 

tct gcc atg cct cat gga cgc gcc agt gac aag gtt ate cag aat aaa 
Ser Ala Met Pro His Gly Arg Ala Ser Asp Lys Val He Gin Asn Lys 
210 215 220 

gag age ttg acc atg gac ttt ggg tgt tac tac aat cac tat gtt agt 
Glu Ser Leu Thr Met Asp Phe Gly Cys Tyr Tyr Asn His Tyr Val Ser 
225 230 235 240 

gat atg acg agg acc att cat att cat att ggc caa gtt act gat gaa 
Asp Met Thr Arg Thr lie His He His He Gly Gin Val Thr Asp Glu 
245 250 255 

gaa cgt gag att tat get ctt gtt ctt get get aat aag get tta att 
Glu Arg Glu He Tyr Ala Leu Val Leu Ala Ala Asn Lys Ala Leu He 
260 265 270 

get aaa get age get ggc atg act tat agt gac ttt gac ggt att ccg 
Ala Lys Ala Ser Ala Gly Met Thr Tyr Ser Asp Phe Asp Gly He Pro 
275 280 285 

cgc caa etc ate act gag gcg ggt tat ggc agt cgc ttc aca cat ggc 
Arg Gin Leu He Thr Glu Ala Gly Tyr Gly Ser Arg Phe Thr His Gly 
290 295 300 

att ggt cat ggc ate ggg ctt gac ate cat gag aat cca ttt ttt ggg 
He Gly His Gly He Gly Leu Asp He His Glu Asn Pro Phe Phe Gly 
305 310 315 320 
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aaa tct gag caa ctt etc caa get gga atg gtg gt aacagatgag 1005 
Lys Ser Glu Gin Leu Leu Gin Ala Gly Met Val 
325 330 

5 ccaggtatct atttggataa caaatatggt gtccgtattg aagatgactt ggttatcaca 1065 
aaactggctt gtcaagtctt gaccttggca cccaaagaat taattgtatt gtaa 1119 
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